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Preface 


Written primarily for students and research workers in the area of the 
behavioral sciences, this book is meant to provide a text and comprehensive 
reference source on statistical principles underlying experimental design. 
Particular emphasis is given to those designs that are likely to prove useful 
in research in the behavioral sciences. 

The book primarily emphasizes the logical basis of principles underlying 
designs for experiments rather than mathematical derivations associated 
with relevant sampling distributions. The topics selected for inclusion 
are those covered in courses taught by the author during the past several 
years. 

Students in these courses have widely varying backgrounds in mathe- 
matics and come primarily from the fields of psychology, education, 
economics, sociology, and industrial engineering. Ithas been the intention 
of the author to keep the book at a readability level appropriate for 
students having a mathematical background equivalent to freshman college 
algebra. From experience with those sections of the book which have 
been used as text material in dittoed form, there is evidence to indicate 
that, in large measure, the desired readability level has been attained. 
Admittedly, however, there are some sections in the book where this 
readability goal has not been achieved. 

The first course in design, as taught by the author, has as a prerequisite 
a basic course in statistical inference. The contents of Chaps. | and 2 
review the highlights of what is included in the prerequisite material. 
These chapters are not meant to provide the reader with a first exposure 
to these topics. They are intended to provide a review of terminology 
and notation for the concepts which are more fully developed in later 
chapters. 

By no means is all the material included in the book covered in a one- 
semester course. In a course of this length, the author has included 
Chaps. 3, 4, parts of 5, 6, parts of 7, parts of 10, and parts of 11. Chapters 
8 through 11 were written to be somewhat independent of each other. 
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Hence one may read, with understanding, in these chapters without undue 
reference to material in the others. 

In general, the discussion of principles, interpretations of illustrative 
examples, and computational procedures are included in successive sections 
within the same chapter. However, to facilitate the use of the book as a 
reference source, this procedure is not followed in Chaps. 5 and 6. Basic 
principles associated with a large class of designs for factorial experiments 
are discussed in Chap. 5. Detailed illustrative examples of these designs 
are presented in Chap. 6. For teaching purposes, the author includes 
relevant material from Chap. 6 with the corresponding material in Chap. 5. 

Selected topics from Chaps. 7 through 11 have formed the basis for a 
second course in experimental design. 

Relatively complete tables for sampling distributions of statistics used 
in the analysis of experimental designs are included in the Appendix. 
Ample references to source materials having mathematical proofs for the 
principles stated in the text are provided. 

The author is indebted to E. S. Pearson and the trustees of Biometrika 
for permission to reproduce parts of Tables B.1, B.3, B.7, and B.9 from 
Biometrika Tables for Statisticians, vol. І, 2d ed. The author is indebted 
to H. L. Harter, D. S. Clem, and E. Н. Guthrie for permission to reproduce 
Table B.4, which was taken from WADC Technical Report 58-484, vol. 
П, 1959. The author is indebted to С. W. Dunnett and the editor of the 
Journal of the American Statistical Association for permission to reprint 
Table B.6. The author is also indebted to C. Eisenhart, M. W. Hastay, 
and W. A. Wallis for permission to reprint Table B.8, which appears in 
Techniques of Statistical Analysis, 1947. The author is also indebted. to 
L. S. Feldt and M. W. Mahmoud as well as the editor of Psychometrika 
for permission to reprint Table B.11. 

Special thanks are due to Mrs. G. P. Lehman and Mrs. R. L. Smith for 
excellent secretarial assistance in preparing the manuscript. 

The author is particularly grateful to Dr. D. A. Wood for many reasons, 
and to Dr. A. Lubin, whose critical reading of the manuscript did much 
to help the author prepare the present version of this book. 

B. J. Winer 
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Introduction 


The design of an experiment may be compared to an architect’s plans for 
a structure, whether it be a giant skyscraper or a modest home. The basic 
requirements for the structure are given to the architect by the prospective 
owner. It is the architect's task to fill these basic requirements; yet the 
architect has ample room for exercising his ingenuity. Several different 
plans may be drawn up to meet all the basic requirements. Some plans 
may be more costly than others; given two plans having the same cost, one 
may offer potential advantages that the second does not. 

In the design of an experiment, the designer has the role of the architect, 
the experimenter the role of the prospective owner. These two roles are not 
necessarily mutually exclusive—the experimenter may do a considerable 
portion of the design work. The basic requirements апа primary objectives 
of the experiment are formulated by the experimenter; the experimenter may 
or may not be aware of the possible alternative approaches that can be 
followed in the conduct of his experiment. It is the designer’s function to 
make the experimenter aware of these alternatives and to indicate the poten- 
tial advantages and disadvantages of each of the alternative approaches, 
It is, however, the experimenter’s task to reach the final decision about the 
conduct of the experiment. 

The individual best qualified to design an experiment is the one who is 
(1) most familiar with the nature of the experimental material, (2) most 
familiar with the possible alternative methods for designing the experiment, 
(3) most capable of evaluating the potential advantages and disadvantages of 
the alternatives. Where an individual possesses all these qualifications, the 
roles of experimenter and designer are опе. On some research problems in 
many experimental fields, the experimenter is capable of making all the 
necessary decisions without seeking extensive assistance. On more com- 
plex research problems, the experimenter may turn to colleagues who are 
equally or more familiar with the subject-matter area for assistance in for- 
mulating the basic requirements and primary objectives of his experiment. 
Problems on the design of the experiment may also be discussed with the 
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subject-matter specialist, and considerable assistance on design problems 
may be obtained from this source. Тһе experimenter may also turn to the 
individual whose specialized training is in the area of experimental design, 
just as the prospective builder turns to the architect for assistance on design 
problems. If the designer is familiar with the nature of the experimental 
material and the outcome of past experimentation in the general area of the 
experiment, he is іп a better position to assist the experimenter in evaluating 
the possible choices as well as to suggest feasible alternative choices. 

In the design of experiments there is ample opportunity for ingenuity in 
the method of attacking the basic problems. Two experiments having 
identical objectives may be designed in quite different ways: at the same cost 
in terms of experimental effort, one design may lead to unambiguous results 
no matter what the outcome, whereas the second design could potentially 
lead to ambiguous results no matter what the outcome. How good one 
design is relative to a second for handling the same general objective may be 
measured (1) in terms of the relative cost of the experimental effort and (2) 
in terms of the relative precision with which conclusions may be stated. 
More precise conclusions do not always demand the greater experimental 
effort, but they generally do demand more careful attention to experimental 
design. 

Without an adequate experimental design, potentially fruitful hypotheses 
cannot be tested with any acceptable degree of precision. Before rejecting 
a hypothesis in a research field, one should examine the structure of the 
experiment to ascertain whether or not the experiment provided a real test 
of the hypothesis. On the other hand the most carefully planned experi- 
ment will not compensate for the lack of a fruitful hypothesis to be tested. 
In the latter case, the end product of this well-designed experiment can yield 
only relatively trivial results, 

One of the primary objectives of this book is to provide the prospective 
experimenter with some of the basic principles used in the construction of 
experimental designs. These principles apply in all areas of experimental 
work. By the use of these principles, an extensive collection of relatively 
standard designs have been constructed to handle problems in design that 
have been encountered їп а variety of experiments. These standard designs 
will be considered in detail, and their potential applications in research areas 
in the behavioral sciences will be indicated, Seldom does an experimenter 
have an experiment that is a perfect fit to a standard design. Some modifi- 
cation is frequently required; this is particularly true in experimental work 
in the area of the behavioral sciences. Careful planning by both the experi- 
menter and the designer is often required in order to cast an experiment ina 
form that will permit the utilization of a standard design or to modify 
standard designs in a manner that will more closely meet the requirements of 
the experiment. 


Principles of experimental design have their roots primarily in the logic of 
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scientific method. Indeed logicians have made substantial contributions to 
the principles of experimental design. The steps from logic to mathematics 
are small ones. The now classic work on the basic statistical principles 
underlying experimental design is R. A. Fisher’s The Design of Experiments. 
This work includes more than purely mathematical arguments—it probes 
into the basic logical structure of experiments and examines the manner in 
which experiments can provide information about problems put to experi- 
mental test. Depending upon how the experiment is conducted, it may or 
may not provide information about the issues at question. What has 
become standard working equipment for the individuals specializing in the 
area of experimental design stems in large measure from this and other works 
of R. A. Fisher. 

What is perhaps the equivalent of a master collection of architect’s plans 
is to be found in the work Experimental Designs by W. G. Cochran and 
G. M. Cox. This work is more than a mere collection of designs, It is a 
carefully prepared and well-organized text and reference book. Illustrative 
material is drawn from many different research areas, although most of the 
material is from the field of agriculture. 

The statistical theory underlying major aspects of experimental design is 
by no means complete. The current literature in the area is extensive. 


CHAPTER 1 


Basic Concepts in Statistical Inference 


1.1 Basic Terminology in Sampling 


A statistical population is the collection of all elements about which one 
seeks information. Only a relatively small fraction, or sample, of the total 
number of elements in a statistical population can generally be observed. 
From data on the elements that are observed, conclusions or inferences are 
drawn about the characteristics of the entire population. In order to dis- 
tinguish between quantities computed from observed data and quantities 
which characterize the population, the term statistic will be used to desi gnate 
aquantity computed from sample data, and the term parameter will be used 
to designate a quantity characteristic of a population. Statistics are com- 
puted from sample data for two purposes: (1) to describe the data obtained 
in the sample, and (2) to estimate or test hypotheses about characteristics of 
the population. 

If all the elements іп a statistical population were measured on a charac- 
teristic of interest, and if the measurements were then tabulated in the form 
of a frequency distribution, the result would be the population distribution 
for the characteristic measured. A description of the population distribu- 
tion is made in terms of parameters. The number of parameters necessary 
to describe the population depends on the form of the frequency distribution. 
If the form is that of the normal distribution, two parameters will completely 
describe the frequency distribution—the population mean, designated y,and 
the population standard deviation, designated с. Ifthe form is not normal, 
the mean and the standard deviation may not be sufficient to describe the 
distribution. Indeed these two parameters may provide relatively little 
information about the distribution; other parameters may be required. 

The sample mean, designated Y, generally provides an estimate of the 
population mean ш. In these same cases, the sample standard deviation, 
designated s, generally provides an estimate of the population standard 
deviation о. The accuracy, or precision, of estimates of this kind depends 
upon the size of the sample from which such estimates are computed, the 
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manner in which the sample was drawn from the population, the сһагас- 
teristics of the population from which the sample was drawn, and the formula 
used to estimate the parameter. 

If a sample is drawn in such a way that (1) all elements in the population 
have an equal and constant chance of being drawn on all draws and (2) all 
possible samples have an equal (or a fixed and determinable) chance of being 
drawn, the resulting sample is a random sample from the specified popula- 
tion. By no means should a random sample be considered a haphazard, 
unplanned sample. Numerous other methods exist for drawing samples, 
Random samples have properties which are particularly important in 
statistical work. This importance stems from the fact that random sam- 
pling ensures constant and independent probabilities; the latter are relatively 
simple to handle mathematically. 

Suppose that one were to draw a large number of samples (say, 100,000), 
each having п elements, from a specified population. Suppose further that 
the procedures by which the samples are drawn are comparable for all 
samples. For each of the samples drawn, suppose that the sample mean Y 
and the sample variance s* are computed. The frequency distribution of 
the X’s defines operationally what is meant by the sampling distribution of 
the sample mean. A distribution constructed in this way provides an 
empirically determined sampling distribution for the mean, The frequency 
distribution of the sample variances would provide an empirically deter- 
mined sampling distribution for the variance. The sampling distribution 
of a statistic depends, in part, upon the way in which the samples are drawn. 

Sampling distributions of statistics are generally tabulated in terms of 
cumulative frequencies, relative frequencies, or probabilities. The charac- 
teristics of sampling distributions are also described by parameters. Fre- 
quently the parameters of sampling distributions are related to the 
parameters of the population from which the samples are drawn. The mean 
of the sampling distribution is called the expected value of the statistic. The 
standard deviation of the sampling distribution is called the standard error 
of the statistic. The form of the sampling distribution as well as the magni- 
tude of its parameters depends upon (1) the distribution of the measurements 
in the basic population from which the sample was drawn, (2) the sampling 
plan followed in drawing the samples, and (3) the number of elements in the 
sample. 

Suppose that the basic population from which sample elements are 
drawn can be considered to be approximately normal in form, with mean 
equal to some value и, and with standard deviation equal to some value ø. 
In other words, the frequency distribution of the measurements of interest 
is approximately normal in form, with specified values for the parameters. 
A normal distribution having a mean equal to и and a standard deviation 
equal to c is designated by М(и,о). If one were to draw a large number of 
random samples of size п from a population in which the measurements 
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have the approximate form М(и,е), the sampling distribution of the statistic 
X would be approximately normal in form, with expected value approxi- 


mately equal to и, and with standard error approximately equal to o/ Vn. 
Thus the sampling distribution of the mean of random samples of size n 
from the approximate population N(u,s) would be approximately 


N(u,o] Vn). This result may be verified by empirical sampling experiments. 

The sampling distribution of the statistic Y, assuming random sampling 
from the exact population №(и,о), сап be derived mathematically from the 
properties of random samples; from purely mathematical considerations 


it can be shown that this sampling distribution is exactly N(u,o/ V n). 
Herein lies the importance of random samples—they have properties which 
permit the estimation of sampling distributions from purely mathematical 
considerations without the necessity for obtaining empirical sampling dis- 
tributions. Estimates obtained from such samples have highly desirable 
properties—the latter will be discussed in a later section. Such purely 
mathematical considerations lead to scientifically useful results only when 
the experimental procedures adequately conform to the mathematical 
models used in predicting experimental results. Also, from purely mathe- 
matical considerations, it can be shown that the statistic (n — 1)s*/o* will 
have a sampling distribution that corresponds to the chi-square distribution 
which has n — 1 degrees of freedom. This last prediction may also be 
verified by sampling experiments. 

If the population distribution is only approximately normal in form, the 
mathematical sampling distributions just discussed provide approximations 
to their operational counterparts; the larger the sample size, the better the 
approximation. One of the basic theorems in sampling theory, the 
central-limit theorem, states that the sampling distribution of the means of 
random samples will be approximately normal in form regardless of the 
form of the distribution in the population, provided that the sample size is 
sufficiently large and provided that the population variance is finite. The 
more the population distribution differs from a bell-shaped distribution, 
the larger the sample size must be for the theorem to hold, 

Statistics obtained from samples drawn bymeans of sampling plans which 
are not random have sampling distributions which are either unknown or 
which can only be approximated with unknown precision. Good approxi- 
mations to sampling distributions of statistics are required if one is to 
evaluate the precision of the inferences made from sample data. 


1.2 Basic Terminology in Statistical Estimation 


Numerical values of parameters can be computed directly from observed 


data only when measurements on all elements in the population are available. 
Generally a parameter is estimate 


er is d from statistics based upon one or more 
Samples. Several criteria are used to evaluate how good a statistic is as an 
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estimate of a parameter. One such criterion is lack of bias. А statistic is 
an unbiased estimate of a parameter if the expected value of the sampling 
distribution of the statistic is equal to the parameter of which it is an esti- 
mate. Thus the concept of unbiasedness is a property of the sampling 
distribution and not strictly a property of a single statistic. When one says 
that a given statistic is an unbiased estimate of a parameter, what one implies 
is that in the long run the mean of such statistics computed from a large 
number of samples of equal size will be equal to the parameter. 

The mean X of a random sample from a normal population is an unbiased 
estimate of the population mean because the sampling distribution of X 
has an expected value equal to и. Suppose that a random sample of size 
n is drawn from a specified normal population; suppose that the mean of 
thissampleis45. Then 45 is an unbiased estimate of the population mean. 
Suppose that a second random sample of size n is drawn from the same 
population; suppose that the mean of the second sample is 55. Then 55 is 
also an unbiased estimate of the population mean. Thus two random 
samples provide two unbiased estimates of the population mean; these 
estimates will not, in general, be equal to one another. There is no way of 
deciding which one, considered by itself, is the better estimate. The best 
single estimate of the population mean, given the two samples, is the average 
of the two sample means. This average is also an unbiased estimate of the 
population mean. Itis а better estimate of jin the sense that it has greater 
precision. 

The precision of an estimator is generally measured by the standard error 
of its sampling distribution. The smaller the standard error, the greater the 
precision. Of two unbiased estimators whose sampling distributions have 
the same form, the better estimator is the one having the smaller standard 
error. The standard error of a sampling distribution is a good index of the 
precision only in those cases in which the form of the distribution approaches 
the normal distribution as the sample size increases. For statistics whose 
sampling distribution has this property, the best unbiased estimator is 
defined to be the one having the smallest standard error. The efficiency of 
an unbiased estimator is measured relative to the square of the standard 
error of the best unbiased estimator. For example, if the squared standard 
error of one unbiased estimator is o?/n and the squared standard error of the 
best unbiased estimator is c?/27, then the efficiency of the first estimator is 
defined to be 


The concept of consistency in an estimator is in a sense related to that of 
unbiasedness. An estimator is a consistent estimate of a parameter if the 
probability that it differs from the parameter by any amount approaches 
zero as the sample size increases. Іп other words, a statistic is a consistent 
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estimator if the bias tends toward zero as the sample size increases. An 
unbiased estimator is a consistent estimator. On the other hand, a con- 
sistent estimator may be biased for small samples. 

Properties of estimators which hold as the sample size increases are called 
asymptotic properties. How large the sample size must be before asymp- 
totic properties can be reasonably expected to hold varies as a function of the 
characteristics of the population and the method of sampling being used. 
Consistent estimators are asymptotically unbiased estimators. Where 
the bias of a consistent estimator is low but its precision is high, the con- 
sistent statistic may be used in preference to an unbiased estimator having 
less precision. 

A parameter is, in most cases, a number. It may be estimated by a num- 
ber, called a point estimate of the parameter. Another way of estimatin ga 
parameter is to specify a range of numbers, or an interval, within which the 
parameter lies. This latter type of estimate is known as an interval estimate 
ofthe parameter. The difference between the largest and smallest numbers 
of the interval estimate defines the range, or width, of the interval, The 
sampling distribution of a statistic obtained by means of purely mathe- 
matical considerations will provide information about the relative Б requency 
(probability) of statistics ina given interval. Probabilities obtained directly 
from such sampling distributions provide predictions about the relative 
frequency with which statistics of given magnitudes will occur, assuming 
that conditions specified in the mathematical derivation are true in the 
population. Thus knowledge of sampling distributions permits one to 
argue from a specified population to Consequences in a series of samples 
drawn from this population. 

In statistical estimation, the objective is to obtain estimates of the 
parameters in the population, given the observations in the sample. The 
parameters are unknown. Given the magnitude of certain statistics com- 
puted from the observed data, from which of several possible alternative 
populations was this sample drawn? Concepts of likelihood, confidence, 
inverse probability, and fiducial probability are used by some statisticians 
to evaluate the answer to this last question. This question can be rephrased 
in terms of two of these concepts. 

à 1. Given a sample, what is the likelihood that it was drawn from a popula- 
tion having a specified set of parameters ? 

2. Given asample, with what confidence can it be said that the population 
from which it was drawn has a specified parameter within a given range? 

The likelihood of obtaining a given sample is the probability of obtaining 
the sample asa function of different values of the parameters underlying 
the population. Admittedly there is only a single set of parameters under- 
lying a specified population. These values are, however, unknown. The 
relative frequency with which certain samples will occur depends upon 
the true values of these parameters. Under one set of assumptions about 
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the parameter values, a given sample may have very high probability of 
occurring, whereas under a second set of assumptions the probability of the 
occurrence of a given sample may be very low. 

R. A. Fisher introduced a widely used principle in statistical estimation: 
one selects as an estimator of a parameter that value which will maximize 
the likelihood of the sample that is actually observed to occur. Estimators 
having this property are known as maximum-likelihood estimators. In 
many areas of statistics, the principle of maximum likelihood provides 
estimators having maximum precision (i.e., minimum standard error). 

An interval estimate is frequently referred to as a confidence interval for a 
parameter. The two extreme points in this interval, the upper and lower 
confidence bounds, define a range of values within which there is a specified 
likelihood (or level of confidence) that that parameter will fall. Given 
information from a single sample, the parameter either does or does not lie 
within this range. The procedure by which the upper and lower confidence 
bounds are determined will, in the long run (if the study is repeated many 
times) ensure that the proportion of correct statements is equal to the level 
of confidence for the interval. The numerical values of the upper and 
lower confidence bounds change from sample to sample, since these bounds 
depend in part upon statistics computed from the samples. 

An interval estimate of a parameter provides information about the pre- 
cision of the estimate; a point estimate does not include such information. 
The principles underlying interval estimation for a parameter are closely 
related to the principles underlying tests of statistical hypotheses. 


1.3 Basic Terminology in Testing Statistical Hypotheses 


A statistical hypothesisis a statement about a statistical population which, 
on the basis of information obtained from observed data, one seeks to 
supportorrefute. A statistical test isaset of rules whereby a decision about 
the hypothesis is reached. Associated with the decision rules is some indi- 
cation of the accuracy of the decisions reached by following the rules. The 
measure of the accuracy is a probability statement about making the correct 
decision when various conditions are true in the population in which the 
hypothesis applies. 

The design of an experiment has a great deal to do with the accuracy of the 
decisions based upon information supplied Бу an experiment. The decision 
rules depend in part upon what the experimenter considers critical bounds 
on arriving at the wrong decision. However, a statistical hypothesis does 
not become false when it exceeds such critical bounds, nor does the hypoth- 
esis become true when it does not exceed such bounds. Decision rules are 
guides in summarizing the results of a statistical test—following such guides 
enables the experimenter to attach probability statements to his decisions. 
In evaluating the outcome of a single experiment or in using the information 
ina single experiment as a basis for a course of action, whether an outcome 
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exceeds an arbitrary critical value may or may not be relevant to the issue 
athand. Probability statements that are associated with decision rules ina 
statistical test are predictions as to what may be expected to be the case if the 
conditions of the experiment were repeated a large number of times. 

The logic of tests on statistical hypotheses is as follows: One assumes that 
the hypothesis that one desires to test is true. Then one examines the con- 
sequences of this assumption in terms of a sampling distribution which 
depends upon the truth of this hypothesis. If, as determined from the 
sampling distribution, observed data have relatively high probability of 
occurring, the decision is made that the data do not contradict the hypoth- 
esis. On the other hand, if the probability of an observed set of data is 
relatively low when the hypothesis is true, the decision is that the data tend 
to contradict the hypothesis. Frequently the hypothesis that is tested is 
stated in such a way that, when the data tend to contradict it, the experi- 
menter is actually demonstrating what it is that he is trying to establish. 
In such cases the experimenter is interested in being able to reject or nullify 
the hypothesis being tested. 

The level of significance of a statistical test defines the probability level that 
is to be considered too low to warrant support of the hypothesis being tested. 
If the probability of the occurrence of observed data (when the hypoth- 
esis being tested is true) is smaller than the level of significance, then the data 
are said to contradict the hypothesis being tested, and a decision is made to 
reject this hypothesis. Rejection of the hypothesis being tested is equivalent 
to supporting one of the possible alternative hypotheses which are not con- 
tradicted. 

The hypothesis being tested will be designated by the symbol H,. (In 
some notation systems this hypothesis has been designated by the symbol 
Hy.) The set of hypotheses that remain tenable when i, is rejected will be 
called the alternative hypothesis and will be designated by the symbol Hy. 
The decision rules in a statistical test are with respect to the rejection or non- 
rejection of Ні. The rejection of H, may be regarded as a decision to accept 
Hz; the nonrejection of H, may be regarded as a decision against the accept- 
ance of Hy. If the decision rules reject H, when in fact Н, is true, the rules 
lead to an erroneous decision. The probability of making this kind of 
error 1s at most equal to the level of significance of the test. Thus the level 
of significance sets an upper bound on the probability of making a decision. 
to reject Н, when in fact H, is true. This kind of erroneous decision is 


known asa type 1 error; the robability of makinga t i d 
рате р y ing a type 1 erroris controlle 


If the decision rules do not reject А, 
hypotheses is true, the rules also lead to 
of error is known as a type 2 error. 
error depends in part upon the level о 
one of the possible alternative hypoth 


when in fact one of the alternative 
an erroneous decision. This kind 
The potential magnitude of a type 2 
f significance and in part upon which 
eses actually is true. Associated with 
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each of the possible alternative hypotheses is a type 2 error of a different 
magnitude, The magnitude of a type 1 error is designated by the symbol 
a, and the magnitude of the type 2 error for a specified alternative hypothesis 
is designated by the symbol £. The definitions of type 1 and type 2 errors 
may be summarized as follows: 


State of affairs in the population 


| 
Decision ONDE (ж 
EEU fe 
2 
Reject H, " E г : | Е Ге 
Ada Hy | Type 1 error (о) | No error 
= — | & LU peni. 
Do not reject Hy | No error | Туре 2 error (f) 


Do not accept Н» 


In this summary, rejection of H; is regarded as being equivalent to accepting 
H, and nonrejection of H, equivalent to not accepting Hy. Тһе possibility 
of a type 1 error exists only when the decision is to reject H,; the possibility 
of a type 2 error exists only when the decision is not to reject Hy. 

The experimenter has the level of significance (type | error) directly under 
his control. Type 2 error is controlled indirectly, primarily through the 
design of the experiment. If possible, the hypothesis to be tested is stated 
in such a way that the more costly error is type | error. It is desirable to 
have both types of error small. However, the two types of error are not 
independent—the smaller numerically the type | error, the larger numerically 
the potential type 2 error. 

To see the relationship between the two types of error, consider Fig. 1.1. 
In part a of this figure the left-hand curve represents the sampling distribu- 
tion of a relevant statistic when Н, is true, and the right-hand curve repre- 
sents the sampling distribution of the same statistic when a particular H, is 
true. The region of rejection of H; is defined with reference to the sampling 
distribution which assumes that Н, is true. Тһе decision rules specify that 
Н, is to be rejected if an observed statistic has any value in the region of 
rejection. The probability of a statistic’s falling in this region is equal to « 
when Н, is true. The type 2 error associated with the particular H, repre- 
sented in part a is numerically equal to the area under the right-hand curve 
which falls in the region of nonrejection of №. 

In part b the numerical value of х is smaller than that in part а. This 
means that the decision rule has smaller type 1 error. Тһе area under the 
right-hand curve in part b that falls in the region of nonrejection of H, is 
larger than the corresponding area in part a. Decreasing the numerical 
value of the type 1 error (level of significance) will increase the potential 
magnitude of the type 2 error. 
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The power of a test with respect to a specified alternative hypothesis is 
numerically equal to 1 minus the probability ofatype2error. Represented 
geometrically, the power of a test is the area of the sampling distribution, 
when Н» is true, that falls in the region of rejection of Ну. In part a of the 
figure this is the area under the right-hand curve that is to the right of the 
critical value. The power of a test decreases as the numerical value of « 
decreases. 


The power of a test may be defined symbolically as 


Power = P(decision rejects Н, | H, true). 


In words, power is the probability that the decision rule rejects H, when a 
specified H, is true. Each of the possible hypotheses іп Н» has its own 


Region of nonrejection | Region of rejection Region of nonrejection Region of rejection 
of Hy of H; of H; of H; 


Critical 
value 


Critical 
value 


(а) (b) 


Figure 1.1 


power. The closer an alternative hypothesis is to Н,, that is, the greater 
the overlap of the corresponding sampling distributions, the lower will be 
the power of the test with respect to that alternative. A well-designed 
experiment will have relatively high power with respect to all alternatives 
which are different in a practical sense from Ну. For example, if Н, states 
that there is zero difference between two means, then one of the possible 
alternative hypotheses is that the difference is 001 unit. For all practical 
purposes this alternative may not be different from Н,; hence power with 
Tespect to this alternative need not be of concern to the experimenter. How- 
ever, an alternative hypothesis which states that the difference is 5 units may 
have practically important consequences if true. Power with respect to this 
alternative would be a matter of concern to the experimenter. 

In research in the area of the behavioral sciences, it is often difficult to 
evaluate the relative costs of type | and type 2 in terms of meaningful units. 
Both kinds of errors may be equally important, particularly in exploratory 
work, Too much emphasis has been placed upon the level of significance 
of a test and far too little emphasis upon the power of the test. In many 
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cases where H, is not rejected, were the power of such tests studied carefully, 
the decisions might more appropriately have been that the experiment did 
not really provide an adequately sensitive (powerful) test of the hypothesis. 

No absolute standards can be set up for determining the appropriate 
level of significance and power that а test should һауе. The level of signif- 
icance used in making statistical tests should be gauged in part by the power 
of practically important alternative hypotheses at varying levels of signifi- 
cance. If experiments were conducted in the best of all possible worlds, the 
design of the experiment would provide adequate power for any predeter- 
mined level of significance that the experimenter were to set. However, 
experiments are conducted under the conditions that exist within the world 
in which one lives. What is needed to attain the demands of the well- 
designed experiment may not be realized. The experimenter must be satis- 
fied with the best design feasible within the restrictions imposed by the 
working conditions. The frequent use of the .05 and .01 levels of signifi- 
cance is a matter of a convention having little scientific or logical basis. 
When the power of tests is likely to be low under these levels of significance, 
and when type 1 and type 2 errors are of approximately equal importance, 
the .30 and .20 levels of significance may be more appropriate than the .05 
and .01 levels. 

The evidence provided by a single experiment with respect to the truth or 
falsity of a statistical hypothesis is seldom complete enough to arrive at a 
decision which is free of all possible error. The potential risks in decisions 
based upon experimental evidence may in most cases be evaluated. What 
the magnitude of the risks should be before one takes a specified action in 
each case will depend upon existing conditions. The data from the statis- 
tical test will provide likelihoods associated with various actions. 


CHAPTER 2 


Testing Hypotheses about Means 
and Variances 


2.1 Testing Hypotheses on Means—o Assumed Known 


To illustrate the basic procedures for making a statistical test, a highly 
simplified example will be used. Suppose that experience has shown that 
the form of the distribution of measurements on a characteristic of interest 
in a specified population is approximately normal. Further suppose, given 
data on a random sample of size 25 from this population, that information 
about the population mean џ is desired. Іп particular the experimenter is 
interested in finding out whether or not the data support the hypothesis that 
n às greater than 50. 

The first step in the test is to formulate H, and Hy. Suppose that an 
erroneous decision to reject the hypothesis that the population mean is 50 
is more costly than an erroneous decision to reject the hypothesis that the 
mean is greater than 50. In this case H; is chosen to be н = 50; this choice 
for H, makes the more costly type of error the type | error, which is under the 
direct control of the experimenter. The alternative hypothesis in this case 
isu > 50. The decision rules for this test are to be formulated in such a way 
that rejection of Н, is to provide evidence in favor of the tenability of Ha. 

The choice for H, could also be и < 50. However, if the data tend to 
reject the hypothesis that и = 50 and support the hypothesis that > 50, 
then the data will also tend to reject the hypothesis that и < 50. Thus, in 
formulating a decision rule which rejects И, only when the data support the 
hypothesis that / > 50, only the hypothesis that и = 50 need be considered. 
In essence the case и < 50 is irrelevant (inadmissible) in formulating the 
decision rule. However, nonrejection of H, would imply ш < 50. 

When it is true that и = 50, the sampling distribution of the mean of 
random samples from a normal population is normal in form, with expected 
value equal to 50 and standard error equal to the population standard 
deviation divided by the square root of the sample size. In practice the 
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value of the population standard deviation will not be known, but to keep 
this example simple, suppose that the population standard deviation o is 
equal to 10. Then the standard error of the sampling distribution of the 
mean for samples of size 25 is o/Vn = 10/25 = 2. Decision rules must 
now be formulated to indicate when observed data are consistent with Н, 
and when observed data are not consistent with H,. The decision rules 
must indicate a range of potentially observable values of X for which the 
decision will be to reject H,. This range of values of X will be called the 
region of rejection for H,. Тһе probability of observing an X in this region 
is to beat most equal to the level of significance of the test, i.e., the magnitude 
ofthe type l error. This sets an upper bound on the probability of reaching 


(1) (2) 


os. 


Figure 2.1 


the wrong decision when Н, is true. Іп addition to satisfying this condition 
with respect to type 1 error, the region of rejection for /7, must have rela- 
tively high probability for the observed Ж when H, is true. Hence the 
decision rules must specify a range of values of potentially observable Ẹ in 
which (1) the probability of an observed X's falling in this region is at most 
equal to the level of significance when H, is true and (2) the probability of an 
A's falling in this region is relatively high when Hy is true, The latter con- 
dition is necessary to assure the power of the test. 

Probabilities associated with the sampling distribution of X when Н, is 
true are required in order to construct the decision rules, In addition, some 
knowledge about the relative location of the sampling distribution of Y 
when each of the possible alternative hypotheses is true is required. Con- 
sider Fig. 2.1. When Н, is true, the sampling distribution of Ẹ is given by 
(1). When Hyis true (that is, uis greater than 50), the sampling distribution 
of X will have an expected value somewhere to therightof 50. In particular, 
one possibility for this expected value is that и = 54. This possibility is 
represented by (2). Areas under these curves represent probabilities. The 
probability of observing an X in a range of values covered by the extreme 
right-hand tail of (1) is relatively low when Н, is true but relatively higher 
when the alternative hypothesis is true. 

Suppose that the experimenter wants to formulate a set of decision rules 
which, in the long run, will make the probability of an erroneous decision 
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when H; is true less than .01. This is another way of saying that the level of 
significance of the testis to be .01. Suppose that the mean of the potentially 
observable sample is designated by the symbol X,,,. Then the decision 
rules will take the following form: 


Reject Н, when Xops is greater than L. 
Do not reject H, otherwise. 


Lis the critical value for Y,,,; L must have the property that 

P(Xovs > L | Н, true) = .01. 
In words, the probability of drawing a sample whose mean is greater than L 
is to be .01 when Н, is true. 

Under the assumptions that have been made, when Н, is true the form and 
parameters of the sampling distribution for sample means are known to be 
N(50,2), that is, normal in form with expected value equal to 50 and standard 
error equal to 2. The tabulated values of the normal distribution are 
directly appropriate only for the standard normal, N(0,1). From the table 
of the standard normal, the probability of observing a value 2.33 standard- 
error units or more above the mean of a population is .01. For the dis- 


tribution N(50,2), 2.33 standard-error units above the mean would be 
50 + 2.33(2) = 54.66. Therefore, 


P(X > 54.66 | H, true) = .01. 


Thus the region of rejection for H, is X > 54.66. When Н is true, the 
probability that a random sample of size 25 from N(50,10) will have a mean 
larger than 54.66 is less than .01. When one of the alternative hypotheses 
is true, i.e., when p is greater than 50, the probability of a sample mean 
falling in this region will be higher than .01 ; the larger the difference between 
the true value of и and 50, the higher the probability of an observed mean 
falling in the region of rejection. 

The steps in the formulation of the decision rule have been as follows: 

p le population of measurements assumed to be normal in form, with 
o= 
4 2. Random sample of size п = 25 elements to be drawn from this popula- 
ion. 

3. Хов to be computed from sample data. 

, The hypothesis being tested, the alternative hypothesis, the level of 

significance of the test, and the decision rules are as follows: 


H0. 
Hy: и> 50, 
w= 01. 
Decision rules: Reject Н; when Yous > 54.66. 


Do not reject H, otherwise. 
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The region of rejection for H, may be represented geometrically as the right- 
hand tail of the sampling distribution for X which assumes H; to be true (see 
Fig. 2.2). 

There are many regions in which the probability is .01 for observing a 
sample mean. The level of significance of a test does not determine where 
the region of rejection is to be located. The choice of the extreme right- 
hand tail of the sampling distribution which assumes H, to be true was neces- 
sary in order to minimize type 2 error (or, equivalently, to maximize the 
power). In general, the alternative hypothesis determines the location of 
the region of rejection, whereas the level of significance determines the size 
of the region of rejection. In this case the alternative hypothesis does not 


N(50,2) 


54.66 


Region of rejection 
for Hy 


Figure 2.2 


include the possibility that is less than 50, М№о matter how much smaller 
than 54.66 the observed sample mean is, H; is not rejected. Thus, if H, is 
not rejected, the evidence would indicate that и is equal to or less than 50. 
On the other hand, if H, is rejected, the evidence would indicate that и is 
greater than 50. Locating the region of rejection for Н, in the right-hand 
tail provides maximum power with respect to the alternative hypothesis that 
ш 15 greater than 50. 

The power of these decision rules with respect to various alternative 
hypotheses is readily computed. For example, the power with respect to 
the alternative hypothesis  — 58 is represented geometrically by the shaded 
area under curve (2) in Fig. 2.3. This area represents the probability of an 
observed mean's being greater than 54.66 when the true sampling distribu- 
tion is №(58,2). With reference to the latter sampling distribution, the 
point 54.66, which determines the region of rejection, is (54.66 — 58.00)/2 
or 1.67 standard-error units below the mean. The area from the mean to 
1.67 standard-error units below the mean is .45. Hence the total shaded 
area іѕ .45 + .50 = .95. Thus the power of this test with respect to the 
alternative hypothesis и = 58is.95. Conversely, the probability of a type 
2 error when ш = 58 is .05. 
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Instead of working directly with the sampling distribution of the statistic 
¥ and formulating the decision rules in terms of the statistic X, it is more 
convenient to work with the statistic 


where / is the value specified by Hı. The sampling distribution of this z 


(1) (2) 


50 ™ 54.66 58 


Region of rejection for Н, 


Figure 2.3 


statistic is N(0,1). Given the mean of an observed sample, Хь», the corre- 
sponding value of the z statistic, when №, is и = 50 and o/ Мп — 2, is 
Xovs — 50 
m ann 


If the alternative hypothesis is Н»: ш > 50, then the decision rules for a test 
having level of significance .01 are as follows: 


Zobs — 


Reject Н, when 2оһ > 2.33. 
Do not reject Н, otherwise. 
The value 2.33 satisfies the condition that 
Р(го > 2.33 | Н, true) = .01. 


This numerical value actually is the 99th centile point on N(0,1) and will be 
designated by the symbol 2. Thus 2 = 2.33. Since the level of 
significance for this test is ® = .01, the critical value for the decision rule 
can be designated 2, ,, which in this case is 2 ә. 

For the general case in which the region of rejection for H; is the right- 
hand tail of Ж(0,1) and the level of significance is equal to some value 2, 
the decision rules take the following form: 


Reject Н, when zops >21-,. 
Do not reject H, otherwise. 
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Suppose that the mean for the sample observed actually is 60. Then 
the numerical value of the z statistic (when Н, is that и = 50 and с[Мп = 2) 
is 


Since Zons is larger than 2.33, Н, is rejected. Hence the observed data do 
not support the hypothesis that the population mean is 50. The data 
indicate that the mean in the population is greater than 50. 


N(0,1) 


Right-hand tail 
of region of 
rejection for H; 


Left-hand tail 
of region of 
rejection for Н, 


Figure 2.4 


If the alternative hypothesis had the form Н»: и 5 50, then the region of 
rejection for H, would have the form 


Zobs < Z,g and 20р > 24-(a/2)- 


For this kind of alternative hypothesis, the region of rejection for Н, in- 
cludes both the left-hand and right-hand extreme tails of the sampling 
distribution associated with H,. The two parts of the region of rejection 
for H, are sketched in Fig. 2.4. For example, if х = .01, the two-tailed 
region of rejection for H, would be z,, less than and 2 and л greater 
than 2. Locating the region of rejection for H, in this manner provides 
power with respect to the possibility that и is less than 50, as well as to the 
possibility that is greater than 50, An alternative hypothesis of this form 
is called a two-tailed alternative hypothesis, and tests which admit to the 
Possibility of a two-tailed alternative hypothesis are called two-tailed tests. 
The size of either tail of the region of rejection is equal to one-half the level 
of significance; the total size of the region of rejection is equal to the level 
of significance. 

In cases in which the experimenter is interested in rejecting Н, only when 
the alternative hypothesis is one having a specified direction with respect 
to Н,, a one-tailed rather than a two-tailed alternative hypothesis is the 
More appropriate. Limiting the region of rejection to one tail of the 
Sampling distribution for H, provides greater power with respect to an 
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alternative hypothesis in the direction of that tail. This fact is illustrated | 


geometrically in Fig. 2.5. Тһе power under a two-tailed test with respect 
to a specified alternative hypothesis to the right of zero is shown by the 
shaded area in part а. The corresponding power with respect to a one- 
tailed test is shown in part b. Although the magnitude of the type 1 error 
is the same in both cases, the increased power in the one-tailed case is at the 
expense of zero power with respect to alternative hypotheses which are to 


N(0,1) 


N(0,1) 


Figure 2.5 


the left of zero. In the latter case, all hypotheses corresponding to sam- 
pling distributions to the left of zero may be considered part of H,. 


2.2 Tests of Hypotheses on Means—o Estimated from Sample Data 


In Sec. 2.1, the value of the population standard deviation о was assumed 
to be known. Generally c is estimated by the sample standard deviation 
s. Testing the hypothesis considered in the last section when o is estimated 
by s involves no basic change in principles for arriving at a decision rule. 
The sampling distribution of the statistic that must be used, however, is no 
longer normalin form. The test statistic convenient for use in this case iS 


-Х-ш 


s/n ° 


t 
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where и is the value of ш specified by H,. When H; is true, the sampling 
distribution of this f statistic is Student's ż distribution having n — 1 
degrees of freedom. (The degrees of freedom for this sampling distribu- 
tion are determined by the degrees of freedom for s.) When H, is not 
true, the form of the sampling distribution for this t statistic is no longer 
approximated by Student’s ¢ distribution but rather by what is called a 
noncentral t distribution. 

For the one-tailed alternative hypothesis ш > ду, the region of rejec- 
tion for Н, is given by 

tops > fj (n — 1), 


where о is the level of significance and 1, „(п — 1) is the 1 — « percentile 
point on Student’s ¢ distribution having п — 1 degrees of freedom. To 
illustrate these procedures, suppose that the data observed in a random 
sample from the population of interest are 


п = 25, Х = 60, 5 = 15. 


Suppose that the statement of the hypothesis to be tested and the alter- 
native hypothesis are 


Hy: u= 50 
Но: 022 50 


and that the level of significance of the test is .01. From the table of the 
distribution of the ¢ statistic having 24 degrees of freedom one finds that 


P(toys > 2.49 | Н, true) = 01. 


That is, the table of the г distribution indicates that / (24) = 2.49. Hence 
the decision rules for this test are as follows: 


Reject H, when tops is larger than 2.49. 
Do not reject Н, otherwise. 
From the sample data, fons is found to be 
gn ae 
15/,/25 


Since f, is greater than the critical value 2.49, tops falls in the region of 
rejection for H,. Hence the decision rules indicate that H, should be 
rejected. ) 
The interpretation of this test is as follows: On the basis of the data in a 
random sample of size 25 from a population of interest, the hypothesis that 
I! = 50 cannot be considered tenable when the test is made at the .01 level 
of significance. If this hypothesis were true, the probability of obtaining 
the data in the sample would be less than .01. The data obtained support 
the hypothesis that the mean of the population is greater than 50. 
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Having rejected the hypothesis that и — 50, suppose that the experi- 
menter wanted to find the largest value for u, which would lead to non- 
rejection of H,. The region of nonrejection for Н, is defined by the 
inequality 

X — 4 
5//п 


Solving this inequality for , gives 


<t-,(n — 1). 


р 5 
у > X — -= ħ-a(n — 1). 
yn 


Thus any value of д, equal to or greater than Y — (s/ Уп), -a(n — 1) will 
yield a / statistic that will fall in the region of nonrejection for H,. For 
the numerical example just considered, any H, that specifies да to be equal 
to or greater than 


60 — (3.00)(2.49) = 52.53 


would, on the basis of the single sample observed, lead to a decision not to 
reject H,. Thus, on the evidence supplied by the single sample observed, 
any value for и, equal to or greater than 52.53 would make tops smaller 
than the critical value of 2.49. Therefore the experimenter may conclude 
that the population mean is likely to be greater than 52.53. If the experi- 
menter were to test hypotheses specifying that м is any value equal to or 
less than 52.53, the decision in every case (for the data in the given sample) 
would be to reject H}. This conclusion may be expressed in the form of a 


one-tailed confidence interval on the population mean. This confidence 
interval takes the general form 


с, >Х--Еі (һ- |- - 

ш ег 1-а(п — 1) 1—a. 

The numerical values in terms of the observed sample data and х = .01 are 
С[и > 52.53] = .9. 


The value 52.53 may be considered as the lower bound for u. If one were 
to draw additional samples, the mathematical form of the lower bound 
would remain the same but its numerical value would change, since the 
Once a sample has been 
е interval determined, the 
ertrueorfalse. However, 


the procedure by which the confidence interval is constructed will, in the 


long run, lead to statements which are correc 
I. 
The pone that has just been considered involved a one-tailed alter- 
даз 'ypothesis. Suppose that the experimenter is willing to reject M, 
Wi ien и is either smaller or larger than 50, In this case the alternative 


> 
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hypothesis takes the form и + 50. То provide power with respect to both 
tails of the alternative hypothesis, the decision rules for this case are as 
follows: 


Reject Н, when tovs | ^ taja(n — 1). 


> ti-a (n — 1). 
Otherwise do not reject Hy. 


The region of rejection for H, for the case of a two-tailed alternative hy- 
pothesis is sketched in Fig. 2.6. The size of the region of rejection in each 
tail is equal to 2/2. The left-hand tail ofthe region of rejection makes pro- 
vision for power with respect to alternative hypotheses и < 50; the right- 


t(n-1) 


Right-hand tail of 
region of rejection 
for H; 


Left-hand tail of 
region of rejection 
for H; 


t1- (v?) 


Figure 2.6 


hand tail of the region of rejection makes provision for power with respect 
to alternative hypotheses ш > 50. 
Forn — | = 24 and « = .01, 


taja = Logs = —2.80, 
and ti-am = Lgs = 2.80. 


(Since Student's t distribution is symmetrical, £j = —f 445.) For this 
case the decision rules are 


—2.80. 
Reject H, when tops ы 


> 42.80; 
Otherwise do not reject H}. 


For Хо, = 60 and sq, = 15, fons = 3.33. Hence the decision rules lead 
to rejecting H,. Having rejected the hypothesis that и = 50, the experi- 
menter may be interested in determining the range of values of и which, 
on the basis of the observed sample data, would not be rejected by these 
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decision rules. This range of values is defined by a two-tailed confidence 
interval on и, which is given by 


C[X—c xu x + с] 21— o, 


where = Е ti-u (n — 1). 

n 
The numerical value of this confidence interval for the observed sample and 
a = 0l is 


C[51.60 < u < 68.40] = .99. 


To illustrate the fact that any value of жіп this range which is selected for 
Н, leads to а decision not to reject H}, consider the hypothesis that и = 68. 
For this H, 


Tu 7 oo ey 


Since £j, is greater than —2.80 the hypothesis that и = 68 is not rejected. 

This relationship between confidence intervals and tests of hypotheses 
applies to many classes of tests. Given а set of statistics computed from a 
sample, and given a confidence interval of size 1 — «ona parameter, then 
the range of values within this interval will provide a range for the param- 
eter in H which will lead to a decision not to reject Н, at level of signifi- 
cance х. If H, specifies values of the parameter outside this range, the 
decision will be to reject H,. Two-tailed confidence intervals are associ- 
ated with two-tailed tests and one-tailed confidence intervals with one- 
tailed tests. Thus confidence intervals provide information about the 
potential outcomes of a series of individual tests. 

The power of tests associated with Student's distribution is more difficult 
to compute than the power of tests associated with the normal distribution. 
Since the sampling distribution of the / statistic when Н, is not true is a 
noncentral t distribution, computation of the power with respect to alter- 
native hypotheses requires tables of the noncentral t distribution. Illus- 


trations of the applications of this latter distribution are givenin Johnson 
and Welch (1940). 


23 Testing Hypotheses about the Difference between Two Means— 
Assuming Homogeneity of Variance 


One problem common to many fields of research may be cast in the 
following form: Which one of two procedures will produce the better 
results when used in a specified population? To provide information 
relevant for an answer, the experimenter may draw two samples from the 
specified population. His experiment might consist of following proce- 
dure A in one of the samples and procedure Bin the other sample. (These 
procedures will be referred to as treatments А and B.) The question, 
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which one of the two is the better, requires some criterion on which to base 
the answer. Several criteria may be relevant—treatment A may be better 
with respect to some of these criteria, treatment B better with respect to 
others. Techniques exist for the evaluation of several criteria simul- 
taneously, but in this section methods for evaluating a single criterion at a 
time will be considered. 

Suppose that the experimenter measures each of the elements in the two 
samples on a single criterion. The results may be that some of the scores 
under treatment A are higher than those under B, and vice versa. If the 
distribution of the scores within each of the samples is approximately 
normal in form, then a comparison of the means of the criterion scores 
provides one kind of information about which of the two treatments gives 
the better results. 

In some experimental situations, the variability on the criterion within 
the samples assigned to different treatments is primarily a function of (1) 
differences in the elements observed that existed before the start of the 
experiment—such differences are not directly related to the experimental 
treatment—and (2) uncontrolled sources of variability introduced during 
the course of the experiment which are in no way related to the treatment 
itself. In such cases one might reasonably expect that the criterion 
variance within each of the samples assigned to the experimental treatments 
is due to common sources of variance. In more technical language, one 
might reasonably expect homogeneity of variance, i.e., that the sources of 
variance within each of the samples are essentially the same and that the 
variances in the corresponding populations are equal. 

It is convenient to formalize the arguments just given in more mathe- 
matical terms. The formal mathematical argument will serve to make 
explicit what it is that one assumes to arrive at the conclusions that have 
just been reached. Let the criterion measure on element / in the sample 
given experimental treatment j be designated by the symbol X,. In this 
case there are two experimental treatments; so jstandsfor either treatment 
Aortreatment B. Suppose that this measurement may be expressed as the 
sum of a quantity ту, which represents the effect of experimental treatment 
} and a quantity ¢,,, which is not directly related to experimental treatment 
Jj. That is, suppose that 
(1) Xi = т; T е. 


The effect e; includes all the unique characteristics associated with the 
element i as well as all uncontrolled effects associated with the experi- 
mental conditions under which the measurement is made. The effect т, is 
assumed to be constant for all elements in the experimental group assigned 
to treatment j, whereas the effect ¢,; varies from element to element within 
the group and is in no direct way related to the experimental treatment. 
The effect е, is frequently called the experimental error. 
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The term Х,; on the left-hand side of (1) represents a quantity that is 
observed. The terms on the right-hand side of (1) cannot be observed— 
they designate the variables that account for what is observed. The terms 
т; and e,; represent structural variables underlying the observed data; 
(1)is referred to asa structural model. The first basic assumption that has 
been made about the variables in the structural model is that they are 
uncorrelated. (Being uncorrelated is a less stringent assumption than 
statistical independence. The latter assumption is, however, required in 
making tests. For purposes of estimation, only the assumption of zero 
correlation is required.) 

Let o; and оў designate the expected values of the criterion variance 
within the respective experimental groups. (That is, o2 represents the 
mean of the sampling distribution of the statistic s?, the variance on the 
criterion for a sample of elements given treatment 4.) Suppose that the 
experiment is designed in such a way that in the long run there will be no 
difference in the unique characteristics of the group of elements assigned 
to treatments А and 8. Suppose also that the experiment is conducted in 
such a manner that the uncontrolled sources of variability are comparable 
in the two experimental groups. These latter assumptions imply homo- 
geneity of experimental error; i.e., they imply that о> = о). These latter 
assumptions also imply that the expected value of the mean of the experi- 
mental error within treatment groups A and B will be equal. That is, if 
ғ, and ғ, represent the respective sample means of the experimental error 
within the treatment groups, the expected values of these quantities will be 
equal if the assumptions are true. The quantities ғ, and ғ, are the means 
of structural variables and cannot be computed directly from a single 
sample. 

Experience has shown that the model (1) and the assumptions made 
about the variables in the model are appropriate for a large class of experi- 
mental situations. The tests to be considered in this section are suitable 
for this class. In terms of this structural model, the mean of the criterion 
Scores for a sample of elements given treatment A may be represented as 


(2) X, — 7, + &. 
Since the effect 7, is assumed to be constant for all elements in the sample, 


its mean effect will be simply 7,. The corresponding mean for a sample of 
elements given treatment B may be represented by 


(3) X, — 7, 4- &. 
The difference between the two sample means has the form 
(4) X, — X, — (ta — ту) + Gq — &). 


In words, (4) says that the observed difference between the criterion mean 
for the sample 


: given. treatment А and the criterion mean for the sample 
given treatment B is in part a function of the difference in effectiveness of 


TESTING HYPOTHESES ABOUT THE DIFFERENCE BETWEEN TWO MEANS 27 


the two treatments and in part a function of the difference between the 
average experimental error associated with each of the means. One 
purpose of a statistical test in this context is to find out whether the ob- 
served difference is of a magnitude that may be considered a function of 
experimental error alone or whether the observed difference indicates 
some effect larger than that due to experimental error. 

The details in the analysis of an experiment designed to study the effects 
of treatments A and B on a specified criterion will now be considered. 
From the population of interest a random sample of л elements is drawn. 
The elements are subdivided at random into two subsamples—one of size 
n,, the other of size n,. (In most cases n, will be equal to n. Тһе most 
sensitive design makes л, and m, proportional to c; and оў}, respectively.) 
The sample of size n, is assigned to experimental treatment 4; the sample 
of size n, is assigned to experimental treatment B. After the administra- 
tion of the treatments, each of the elements in the experiment is measured 
on a common criterion. Suppose that experience in related research 
indicates that the distribution of such criterion measures tends to be 
approximately normal in the population of interest. Suppose that inspec- 
tion of the observed data does not contradict what past experimentation 
indicates about the form of the distribution of the criterion measures within 
each of the experimental conditions. 

To summarize the information from the experiment, the mean and 
standard deviation for each of the experimental groups are computed. 
The sample statistics Y, and 82 provide, respectively, estimates of the 
parameters u,and oè. Similarly for treatment group B, X, and s; provide, 
respectively, estimates of the parameters у and oj. A schematic outline 
of this experiment is given in Table 2.3-1. If the form of the distribution of 
the measurements were not approximately bell-shaped, other statistics 
might be more appropriate to summarize the information in the samples. 

If one were to repeat this experiment a large number of times, each time 
Starting with a different random sample, and if one were to compute the 
statistic Y, — Ж, for each experiment, the sampling distribution of this 
statistic would be found to be approximately normalinform. Theexpected 
value of the sampling distribution would be approximately xa — /», and the 
standard error would Бе V(o2/n,) + (02/7). Since ozis assumed to be equal 
to оў, let the common value of the population variance be designated by oi. 
In terms of o?, the standard error of the sampling distribution would have 
the form 


— deti i 
o Ох-х, "n г. ny 


The value of the common population variance 7; is not known. Given 52 
and s? computed from sample data, the best estimate of 0% is a weighted 
average of the sample variances, the weights being the respective degrees of 
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freedom. Designating this weighted average by the symbol 52, 
6 3a Ds; + (m = Dsi 

(n, — 1) + (m — 1) 
This weighted average of the sample variances is known as the pooled within- 
class estimate of the common population variance. The degrees of freedom 


Table 2.3-1 Outline of Steps in an Experiment 


Basic population 
of interest 
Random sample of 
Nat n, elements 


Treatment B 


Summary statistics: 
X, 


2 
зь 


Parameters of population 
after treatment A 


Ha 
сі 


Parameters of population 
after treatment B 


for 5j are the sum of the respective degrees of freedom for the parts; i.e., 
(n, — 1) + (1, — 1) =n, - m — 2. 
The statistic used to test hypotheses about My 


— his 
(1) tı Xo = X) — (и — и) 

Мап, + 1m) 
The sampling distribution of this statistic, under the assumptions that have 
been made, is the 1 distribution having n, + m, — 2 degrees of freedom. 
Operationally, if this experiment were to be repeated a large number of 


times, each time witha different random sample froma specified population, 
and if one actually knew the value of Ша — шь then the distribution of the 
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resulting г statistics could be approximated by the / distribution having 
п, + n, — 2 degrees of freedom. The latter degrees of freedom are those 
associated with 52. In general one does not know the value of p, — ju. 
Its value must be specified by Hi. 

The denominator of the statistic defined in (7) is often symbolized by 
5$, ло that is, 


Sr IN 
The 1 distribution is used to test hypotheses about и, — д. In terms of 
the right-hand side of the structural model in (4), the expected value of 
numerator of the statistic in (7) is Е(ғ, — 6) = 0, and the expected value of 
the denominator is о; ,. Hence, when the hypothecated value of u, — to 
is actually the true value of the difference between these parameters, the ¢ 
statistic provides a standardized measure of the difference in the average 
experimental error for the two experimental conditions. 4 
Suppose that the experimenter is interested in testing the following hy- 
pothesis: 
Hy: Ma — № = 5. 
Hg: Ha — № F Ô. 
Level of significance = «. 


The numerical value of ô is the smallest practically important difference of 
interest to the experimenter. (This value is often taken to be zero.) | Since 
the alternative hypothesis is two-tailed, a two-tailed region of rejection for 
Н, is required, The region of rejection for Н is defined by the two tails of 
the sampling distribution which assumes Hi to betrue. The decision rules 
are as follows. 


А < talna + my — 2). 
Reject H, when tons | эн + Mo 
> ата + My — 2). 


Do not reject Н; otherwise. 


The ż statistic will be numerically large when (1) H, is not true or (2) Н, is true 
but the difference between the mean experimental errors is unusually large 
relative to what is expected on the basis of the assumptions underlying the 
experimental design. The probability of rejecting H, when the latter con- 
tingency occurs is less than the level of significance of the test. The region 
of rejection for H, is sketched in Fig. 2.7. j 

To illustrate the use of the 7 statistic and its sampling distribution in 
making the test about the difference иа — /%, suppose that the following 
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data are obtained from an experiment designed and conducted under con- 
ditions such that the assumptions underlying the sampling distribution of 
the ¢ statistic are satisfied: 


aaaea 


Treatment Sample size Sample variance Criterion mean 
A па = 8 53 = 18 x, 20 
B m = 10 si = 12 Ж, = 25 


The experimenter is interested in making a two-tailed test on the hypothesis 
that и, — fy = 0. Since rejecting this hypothesis when it is true is con- 
sidered by the experimenter to be a more costly error than not rejecting this 
hypothesis when it is false, H, has the form и, — и, = 0. The level of 
significance for this test is chosen to be .05. The pooled estimate of the 


t(n,+n,—2) 


Left-hand tail of Right-hand tail of 
region of rejection region of rejection 
for H; for H; 


Figure 2.7 


common population variance will have 16 degrees of freedom. Hence the 
decision rules for this test will have the following form: 


= —2 12° 


PO 5012; 
Do not reject H, otherwise. 


Reject Н, when tons 


The value 2.12 is the .975 percentile point on the ¢ distribution having 16 


degrees of freedom, that is, /,..(16) = 2.12, Since the t distribution is 
symmetrical, t oə5(16) = —2.12. 


For these data, using (6), the estimate of the population variance is 


1 
s = X8) + 9012) = 302). 14.62, 


The value of ty» is given by 
top, — Q0—25 —0 5 5 
obs =ч; a 
V14.628 +75) 14.6288) 182 


= —2.75. 
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Since f, is less than the critical value —2.12, Н, is rejected. Thus the hy- 
pothesis that treatments A and B are equally effective with respect to the 
criterion measure is not supported by the experimental data. Inspection 
indicates that treatment B has the higher mean. Hence the data support the 
alternative hypothesis that ц, — дь < 0, that is, и, < дь. 

The hypothesis that the difference between the population means is zero 
having been rejected, the tenable values for this difference are given by a 
confidence interval for и, — иь. This interval has the general form 


CX, — X) — e <= m (Y X) c] = 1-4, 
where © = 1ү-(узу$у Ж, 


For the numerical data being considered, 
c = 2.12(1.82) = 3.86. 
Hence a .95 confidence interval on the difference between the two treatment 
means is 
C[—8.86 < д, — u, € —1.14] = 595. 
For these sample data, any Н, which specifies that и, — Mo is within this 
interval will lead to nonrejection of Н, when « = .05. 


2.4 Computational Formulas for the / Statistic 


For simplicity in computational work, the basic formula for the г statistic 
is not the most suitable for use. То avoid rounding error at intermediate 
stages of work, the best computational procedure puts off divisions and 
square roots until the final stages. For accurate computational work the 
general rule is to do all possible addition, subtraction, and multiplication 
before doing any division or taking any square root. To arrive at a com- 
putational formula for the 2 statistic, the following notation will be used: 


XX, = sum of the n, observations in experimental group A. 
=X, = sum of the n, observations in experimental group B. 
EX = XX, + EY, = sum of all observations in the experiment. 
ЖАЎ = sum of the squares of the observations in experimental group A. 
ХХ) = sum of the squares of the observations in experimental group B. 
=X? = XX 24 EX? = sum of the squares of all observations in the experiment. 


The definition of the symbol L, is 


(1) La = EX; — (2X). 
An analogous definition of the symbol L, is 
(2) L, = nEXi — 9X. 


In terms of this notation, a computational formula for the square of the ¢ 
Statistic, when Н, has the form м, — дь = 9, is 

Q) р (a E m = 2X, — MEX 

Б (ng + Ny)(NpLg + MaLo) 
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Taking the square root of г? yields the / statistic. Тһе algebraic sign is given 
by the sign of Y, — Х,. 


In terms of L, and L, computational formulas for variances are 


(а) gan 
n(n, — 1) 
L 
5 $= 
5 Ы n(n, — 1) 


A computational formula for 55, Which minimizes rounding error at inter- 
mediate stages, is 


(6) 52 nan, EX? — n(XX,y — n EX, 


» 


пап (п, + ny — 2) 


There is one disadvantage to these formulas: the numbers involved at inter- 
mediate steps tend to become quite large even for relatively small samples. 
Use of these formulas is illustrated in the numerical example given in Table 
2,4-1. 

Table 2.4-1 Numerical Example of Computation of ¢ Statistic 
SE SE a eres ae 


Treatment A Treatment B 
3 6 
5 5 
n, = 7 ^ 1 п, = 10 
6 9 
2 4 
7 7 
8 
9 
7 
ХХ, = 29 Жуз = 70 
Уха = 143 EX? = 514 
La = 7043) - (29) = 160 1, = 10(514) — (70)? = 240 
160 240 
58 = = 3.81 $ жас = 67 
4 ТӨ % = 1009) 2; 
А =} = 414 ¥, = 19 = 700 


p (а + п, — 2)(п®Х„ — ngEX,)? 
(па + т), + nL) 
.. (15)[10(29) — 7(70)? T 600,000 10.760 
(17)10(160) + 7(240)] 55,760 $ 


Е.(1,15) = 4.54 
t = —V10.760 = —328t tls) = —2.13 


T Negative sign is used because X, — X, is negative. 
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The decision rules for Hy:/4. — рь = 0 against a two-tailed alternative 
hypothesis may be stated in terms of either £j, or ih. In terms of tons the 
decision rules are (assuming that « = .05) 


Reject Н; if tovs BSE 


5542218; 
Otherwise do not reject Hy. 


Since һы = —3.28, Н, is rejected. Inspection of the data indicates that 
treatment B gives the greater mean criterion score. In terms of /2,,, the 
decision rules are 

Reject Н, if tans > 4.54. 


Otherwise do not reject H4. 


In this case /2,, = 10.76; hence H; is rejected. Тһе critical value for the ? 
statistic is F,_,(1, n, + n — 2) or equivalently Beta --" — 2). 


2.5 Test for Homogeneity of Variance 


The test on population means developed in Sec. 2.3 was based upon a 
structural model which assumed that c? = oj. In the absence of extensive 
information from past experimentation in an area, the data obtained in the 
experiment are sometimes used to make preliminary tests on the model. 
Preliminary tests on structural models do not establish the appropriateness 
of the models; rather their appropriateness depends upon the design of the 
experiment and the nature of the sources of variation. The purpose of pre- 
liminary tests is to provide a partial check on whether or not the observed 
data tend to be consistent with the model. The observed data may actually 
be consistent with several models. { і 

Moderate departures from the hypothesis that о2 = о) do not seriously 
affect the accuracy of the decisions reached by means of the t test given in 
Sec. 2.3. In more technical language, the / test is robust with respect to 
moderate departures from the hypothesis of homogeneity ofvariance. An 
extensive investigation of the effect of unequal variances upon the / test 
and the corresponding F test is found in the work of Box ( 1954). Тһе term 
moderate in this context is relative to the magnitude and difference in sample 
sizes. To illustrate the effect of unequal population variances upon the 
accuracy of the decision rule based upon the ¢ test, which assumes equal 
population variances, consider the case їп which o? = 20. If n, = 5 and 


ль = 5, then the 95th percentile point on the F distribution which assumes 


the population variances equal is approximately equal to the 94th percentile 
h actually takes into account the 


point on the sampling distribution whic Ч 
difference in the population variances. Thus for an .05-level test with 
these sample sizes, this violation of the assumption that o; = 6) results 
in an error in the level of significance of approximately 1 per cent (in the 
direction of rejecting H, more often than should be the case). 
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The work of Box (1954) also indicates that the / test is robust with respect 
to the assumption of normality of the distributions within the treatment 
populations. That is, the type 1 error of the decision rule is not seriously 
affected when the population distributions deviate from normality. Even 
when population distributions are markedly skewed, the sampling distribu- 
tion of the / statistic, which assumes normality, provides a good approxima- 
tion to theexact sampling distribution which takes into account the skewness, 
In summary, preliminary tests on the structural model for the ¢ test, which 


F(n,-1, n,—1) 


Figure 2.8 


assumes homogeneity of variance and normality of distributions, are not of 
primary importance with respect to type 1 error, particularly preliminary 
tests for normality of distribution. 

The test of the hypothesis that o; = o7 is useful in its own right, quite apart 
from its use in a preliminary test. Suppose that the following data are 
potentially available for random samples given treatment A and treatment В: 

к DL D PO ы ы са эы. 
Treatment | Sample size | Sample variance 


ites 25735: sten cro 
B he | К 


Assuming that the distribution of the measurements in the treatment popula- 
Uonsis approximately normal, under the hypothesis that o? = o? the statistic 
2 
Sy 
has a sampling distribution which is approximated by F(n, — 1, m — 1), 
that is, an F distribution with т — 1 degrees of freedom for the numerator 
and ль — 1 degrees of freedom for the denominator. Operationally, if the 
experiment from which the potential data given above were to be repeated a 
large number of times, and if for each experiment an F statistic were com- 
puted, then the resulting distribution of the F statistics would be approxi- 
mately F(n, — 1,m — 1). The region of rejection for the hypothesis 02 = % 


TEST FOR HOMOGENEITY OF VARIANCE 35 


against the alternative hypothesis o; A oj is sketched in Fig. 2.8. The 
F distribution is not symmetrical, but there is a relationship by means of 
which percentile points in the left tail may be obtained from points in the 
right tail. This relationship is 


(1) Еп, — 1, m — 1) : 


Fi-a — 1, ny — 1) 

Most tables of the F distribution provide values only for the right-hand 
tail. Values for the left-hand tail are readily computed by means of the 
relation (1). There is, however, a procedure by which use of the left-hand 
tail may be avoided. It is actually immaterial which treatment is called A 
and which B. Suppose that К, is defined to be the ratio of the larger 
sample variance to the smaller sample variance. Hence Fons will always 
tend to fall toward the right-hand tail of the F distribution for which the 
degrees of freedom for the numerator are equal to the degrees of freedom 
for the larger sample variance and the degrees of freedom for the denomina- 
tor are equal to the degrees of freedom for the smaller sample variance. 

To illustrate this test, suppose that the following data were obtained in an 
experiment: 


Treatment | Samplesize | Sample variance 


A n, = 10 
В nm = 8 


To test the hypothesis o? = о) against the alternative hypothesis 0% A 9), 
chosen to be« = .10, then the critical 


= 3.29. Since Fyns does not 
is not contradicted by the 


Ifthe level of significance for this test is 
value for the region of rejection is F5;(7,9) 
exceed this value, the hypothesis that o; = [71 
Observed data. 

If one draws a large number of rand 
mately normal population (14,97); 
statistic 


om samples of size i, from an approxi- 
then the sampling distribution of the 
n, — 10% 


dU Pulp ЖЕН 
Ха аа 
is approximated by the chi-square distribution having n, — 1 degrees of 
freedom, If one were to draw similar samples from the population which 
is approximately М(/,02), then the sampling distribution of the statistic 
За (ny, — Ds, 
2% E 
atistic having, — 1 degrees of freedom. 


is approximated by the chi-square st І 
have no elements іп common, the two 


Assuming that the two populations 
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chi-square statistics will be independent. In an actual experiment there 
will be one sample from the A population and one sample from the B 
population. 
The statistic 

Q) Ez zin, = si 

zl, ed 64/0), 
has a sampling distribution given by F(m, — 1,m — 1). This statistic may 
be used to test any hypothesis about the relationship between о? and оў. 
Under the hypothesis that 02 = оў, the ratio o2/o? = 1.00, and (2) assumes 
the form F = 52/52. The ratio of two statistics whose sampling distributions 
are independent chi squares will always have a sampling distribution given 
by an F distribution. 


2.6 Testing Hypotheses about the Difference between Two Means— 
Assuming That Population Variances Are Not Equal 


Suppose that independent random samples of size n, and m, are drawn 
from a common population and are assigned, respectively, to treatment 
conditions А and B. The elements are then measured оп а common cri- 
terion. The following data may be obtained from this type of experiment: 


р ee ee 


Treatment | Sample size Sample mean Sample variance 
A LA X, ха 
В ny X, % 


SS еі ы ЗАНЕ) s 


The sample data for treatment 4 provide estimates of the parameters of a 
population A, a hypothetical population which consists of the collection 
of elements after the administration of treatment 4. Thus Y, is an estimate 
of Ha aNd Sa isanestimateofo?. Ifthedistribution of the criterion measures 
1s approximately normal in form, the population А may be designated as 
approximately N(,,02). The data from sample B are assumed to provide 
estimates of the parameters ofa population which is approximately №(и,0%). 
Thus populations 4 and B are potential populations that could arise from 
the basic common population from which the samples were drawn. 

n Бес 2.3, tests about Ma — шь Were made under the assumption that 
9, = б). In this section, tests about Ha — uy make по assumption about 


the equality of the population variances. The sampling distribution of the 
Statistic 


a) z=% №) — (и. — n) 
V(@3/n,) + (о/п,) 
15 М(0,1). Twosets of parameters are involved in this statistic, Hypotheses 


about - y, cannot be tested unless values for оў and оў are known. 
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n, and ль are both large (say, larger than 30), then s; and 5; may be sub- 
stituted for the corresponding parameters and the resulting statistic has a 
sampling distribution approximated by N(0,1). For small samples, how- 
ever, the statistic 
(1a) н бз Xy) — Ha — №) 
Уб) + (уп) 


has a sampling distribution which is neither the normal distribution nor 
Student’s ¢ distribution. 

Several workers have attempted to obtain the exact sampling distribution 
of the /' statistic. The end products have differed slightly because of dif- 
fering assumptions that weremadein the course of thederivations. Behrens 
made one attempt to obtain the sampling distribution of the /' statistic; 
Fisher enlarged upon the work of Behrens, and the resulting distribution 
is called the Behrens-Fisher distribution. Tables of the Behrens-Fisher 
distribution are given in Fisher and Yates (1953, p. 52). A method of 
approximating critical values of the Behrens-Fisher distribution by Student’s 
t distribution is given by Cochran and Cox (1957, p. 101). 1 

Not all mathematical statisticians agree with the logic underlying the 
derivation of the Behrens-Fisher distribution. Using a somewhat different 
approach to a related problem, Satterthwaite (1946) derived an approxima- 
tion for the г’ distribution which involves Student's distribution having 
degrees of freedom approximated by the quantity 


U? 
2 = ———á 
© f [V3/(n, — 1)] + [Wm — 1)] 
where у-%, 29, and U=V+W. 
n, n, 


The critical values for the r’ statistic are obtained from tables of Student's / 

distribution having degrees of freedom equal to the nearest integer to f. 

A slightly different (and perhaps closer) approximation for f is given by 
U? 

3 = 13 Ббы 

E f (Vn, + 0] + (Wim + 0] 

Welch (1947) has derived an exact sampling distribution for the f’ statistic. 
Tables for this distribution have been prepared by Aspin (1949). Welch 
has shown that his distribution for the // statistic may also be approximated 
by Student's t distribution, the appropriate t distribution having degrees of 
freedom 
(4) f Л 


© fy? + fall — © Hy 


where ,-,,-, f-m-l ай с=с Gay’ 
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The value of fin (4) will be less than f, +f, but greater than the smaller of 
f.andf, If H, is rejected using the degrees of freedom equal to the smaller 
of f, and fù, the value of fneed not be computed, since the decision reached by 
its use will be to reject Н. The Welch approximation to the sampling dis- 
tribution of the /' statistic is from many points of view the soundest. Its use 
is illustrated in Table 2.6-1. 


Table 2.6-1 Numerical Example 


Treatments 
A B 
Sample size n, = 16 | m= 8 
Sample mean ¥, = 30.00 X, = 21.00 
Sample variance 52 = 32.00 | 5? = 80.00 
эы >л лг лы с... 


Нуда — My = 
30.00 — 21.00) — 
Agila — йу #0 Ма ао 2100) 310 0.00 21.00) = 0 = T = 2.60 
m V (32.00/16) + (80.00/8) v12 
d=; 


Кеш ы сне di 
- 32.00/16 fave T (15)7) 

(32.00/16) + (80.00/8) ` iz 7(.167)* + 15(.833)? 

= 9.90 

с — ——— ISIE ТИНИ 

To test the hypothesis that и, — д, = 0 against a two-tailed alternative 
hypothesis, the decision rules for a test with level of significance equal to 
.05 are as follows: 


€ 


Reject Н, if tps | ^ 10:07). 
> 19507). 
Otherwise do not reject Hj. 


The nearest integer to the numerical value of f computed in Table 2.6-1 is 10. 
From tables of Student's / distribution, /,,,(10) = 2.23. The value of 
ды = 2.60 is greater than 2.23; hence the hypothesis that и, — д, = 0 is 
rejected. In this case it is not actually necessary to compute the value of f. 
The degrees of freedom for the variance associated with the smaller sample is 
7; hence the numerical value of f will be largerthan7. Using г 9;,(7) = 2.36 
as a critical value leads to the decision to reject Ну. This critical value is 
larger than that based upon f. Hence Н, will also be rejected when the 
prope number of degrees of freedom is used in determining the critical - 
value. 

_ The critical value obtained from tables of the Behrens-Fisher distribution 
is 2.20. The use of the Cochran and Cox approximation to the Behrens- 
Fisher test will also be illustrated for the data in Table 2.6-1. The critical 
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value for 15, is given by 
Wala + Woly 


(5) teritical = 
Wa + Wy 


where w, = s;/n;, Wy = 5у/ль, ta = ова — 1), and fj = fi. (um — 1). 
For the data in Table 2.6-6 when х = .05, w, = 2.00, w, = 10.00, f, = 2.13, 
ty = 2.36, and 


2.00(2.13) + 10.00(2.36) ,,, 
12.00 »d 


teritical 


In general бы computed in this manner will differ only slightly from the 
critical value obtained by means of the Welch approximation. 

Formula (3) is actually a modified version of (4). Using (3) to compute 
J for the numerical data in Table 2.6-1 gives 


у= —— 20200 fio poni mer "qoos 
[(2.00)°/17] + [(10.00)2/9] 11.35 


Use of formula (4) gave 9.90. Use of (4) is to be preferred to (3). 


27 Testing Hypotheses about the Difference between Two Means— 
Correlated Observations 


In Secs. 2.3 and 2.6 the data from two independent samples were used as 
the basic data for tests. Elements in one of the samples were independent of 
(in no way related to) elements in the other sample. Hence the data in the 
two samples were uncorrelated. If the elements in single sample of size п 
are observed under both treatments 4 and B, then the resulting data are 
generally correlated. In this case two measurements are made on each ele- 
ment; hence the term repeated measurements to describe this type of design. 
If the order in which the treatments are administered has no effect upon the 
final outcome, then the difference between the two measures on the same 
element on a common criterion provides a measure of the relative effective- 
ness of the treatments. This measure is free of variation due to unique but 
systematic effects associated with the elements themselves. In this respect, 
each element serves as its own control. 

The following structural model, which describes the sources of variation 
that underlie an observation, is applicable in a variety of experimental 
Situations, 

(1) Хы = Ta F Ti + io 


where X,, = criterion measure for element i under treatment А; 

= assumed magnitude of effect of treatment A; 

= unique systematic effect associated with element /; 

= uncontrolled sources of variance affecting observation X;, 


(the experimental error). 


Та 


т 
€, 


i 


ia 
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Each of the structural variables on the right-hand side of (1) is assumed to be 
independent of the others. The corresponding structural model for an 
observation on element / under treatment В has the form 


(2) Xio = 7, + T; + ё. 

In Sec. 2.3, the structural model for an observation combined the term 
7, With the experimental error, since т, could not be distinguished from 
experimental error. In experimental designs having repeated measures, 


this source of variance may be eliminated from experimental error. The 


difference between two measures on the same criterion made ona randomly 
selected element i is 


(3) d, = Xa — Xn = (ra — ть) + (Eai — гь). 
This difference does not involve the variable v, The term т, — т is 
assumed to be constant for all elements, whereas the term Eai — €, Varies 


from element to element. Hence all variance of the d/'s is a function of 


Eai — ғы, Which is assumed to be a measure of experimental error. The 
mean of the differences has the structural form 


(4) d= X, — X, = (Ta — т) + (&, — &). 


Thus d consists of the sum of two components. One ofthe components is 
à measure of the difference between the treatment effects; the other is a 
measure of the difference between the average experimentalerror associated 
with each of the treatments. 

If the experimental error for the observations made under treatment A 
differs in no systematic way from the experimental error for the observations 
made under treatment B, then it is reasonable to assume that 


(5) E(, — &) — 0. 
"Therefore, 
(6) E(d) ^ т, — Ty = Ma — My 


In words, d provides an unbiased estimate of Ha — m. Ifthe distribution of 


the d’s is approximately normal with variance equal to 02, the expected value 


of the variance ofthe term Eai — & in the structural model, then the sampling 


distribution of the statistic 
(7) 4 (>ш-ш) 

Vsaln 
may be approximated by Student’s ¢ distribution havi 
freedom, where n is the number of differences. 
mate of о, 

The application of this sam 
Ha — {a Will be illustrated by 
represent the measures on a c 
tration of a treatment. 


ng n — 1 degrees of 
In (7), s? is used as an esti- 


pling distribution to testing hypotheses about 
means of the data in Table 2.7-1. These data 
ommon criterion before and after the adminis- 
In this type of experiment the question of which 
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treatment is to be administered first is not relevant. For a two-tailed test 
at the .05 level of significance of the hypothesis и, — дь = 0, the decision 
tules are as follows: 

< (6) = —2.45. 


>ішы(6) = 2.45. 
Do not reject Н, otherwise. 


The value of /,,, = 3.61 falls in the region of rejection for H,. Inspection 
of the data indicates that the difference in the population means is probably 
greater than zero. 


Reject H; if tons 


Table 2.7-1 Numerical Example 


Person number | Before treatment | After treatment Difference 

1 3 6 3 

2 8 14 6 

3 4 8 4 

4 6 4 -2 

5 9 16 7 

6 2 d 5 

7 | 12 19 xd 
Xd = 30 4-429 
Xd* — 188 


mee | o ка 0o eee 
La = n Xd? — (5а)? 
= 7(188) — (30)? = 416 
2 La 416 _ ооо Уа = V9.90/7 = 1.19 


UL 
Eu coo ©су 7 р рсе eB o 


Hyg — и = 0 


Hz: ha — My #0 tops = 119 


а = .05 
eee ee) ee SS =. 


A confidence interval for the difference between the population means 
takes the following general form, 
СХ-с<ш-ы Х+о)= 1—«, 
where C= h-am (n — ӘУЕС 
The symbol 7, (n — 1) is to be interpreted as the [1 — («/2)]-percentile 


point on the / distribution with л — 1 degrees of freedom, not as л — 1 times 
this value. For the data in Table 2.7-1, 


с = 2.45(1.19) = 2.92. 
The confidence interval is 
Cai Sla Ste) = 95. 
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Hypotheses which specify that the difference in the population means lies in 
this range will lead to a decision not to reject H, when the data in Table 2.7-1 
are used to make the test. 

If a single element is to be given two different treatments for the purpose 
of evaluating the difference in the effectiveness, there is the question of which 
treatment is to be given first. It may make some difference in the results if 
the treatments are administered in the order A followed by B or B followed 
by B. That is, the carry-over effect, if any, from the sequence A followed 
by B, or vice versa, may be different. This kind of effect is known as a 
treatment by order interaction. То check upon the possibility of this inter- 
action in a repeated-measures design, half the elements in the study may be 
given one sequence, the other half the reverse sequence. This design will 
permit the evaluation of order effects as well as treatment by order inter- 
actions, Tests associated with the evaluation of interaction effects are 
discussed in Chap. 7. However, if no interaction of this type exists, the f 
statistic in (7) may be used to test the difference in treatment effects when the 
order of administration of the treatments is counterbalanced. In the 
absence of any information about this interaction, the method of attack in 
Chap. 7 is to be preferred to the use of the г statistic in (7). 

The variance % іп the denominator of (7) may be expressed in terms of the 
variances of the individual measures, s? and sè- This relationship is 

бізді +s? —2rasis, 


(8) 


= var, + var, — 2 coy, 


abs 


where гь is the product-moment correlation between the measures, and 
where соул, is the covariance between the measures. The following rela- 
tionship holds: 


COVan = Гар545)- 


Mel notations var, and var, are sometimes used interchangeably with sj 
and sj. 


The computation of s; by means of this formula is illustrated by the 
numerical data in Table2.7-1. А summary of the computational steps for 
the statistics needed is given in Table 2.72, Substituting the numerical 
values in Table 2.7-2 for the corresponding statistics in (8) gives 


5а = 32.62 + 12.90 — 2(.868)(5.71)(3.59) = 9.91. 


Within rounding error, this is the value 
the difference measures. 


One of the primary advantages of repeated measures is the potential 
reduction in variance due to experimental error. For this design 


(9) E 


obtained by working directly with 


2 2 2 

8, Sa 8; 2 

4 " b T abSas 
gaw 


n n n n 
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The corresponding estimate for the case of uncorrelated observations is 


(10) ue ea 

ba Soe Шу 

If the correlation is positive, and it will be positive if the model in (1) holds, 
then the estimate of the experimental error obtained from a design in which 
(9) is appropriate will be smaller than that obtained from a design involving 
uncorrelated observations by a factor of 2r,5,5,. However, the degrees of 
freedom for the estimate in (9) aren — 1, whereas the degrees of freedom for 
the estimate in (10) are 2n — 2. Before a repeated-measures design may be 


Table 2.7-2 Numerical Example (Continued) 
Meson io) pS л --------- 


Before treatment (B) After treatment (A) Product of treatments 
ХХ, = 44 IX, =74 
ХХ) = 354 IX? = 978 ХХХ, = 572 
Ly = п®Хў — X La =n X2 — (2X)? |La = nEXQX,— (®Х„)(®Х,) 
= 7(354) — (44)? = 7(978) — (74)? = 7(572) — (74)(44) 
= 542 = 1370 = 748 
p= yd = гъ = —Le. 
5b n — 1) San — 1) RTT EI 
z 748 
= 12.90 = 32.62 7137545 
5, = 3.59 Sq = 5.71 = .868 


ee 


considered more efficient than a design which does not have repeated meas- 
ures, the decrease in the experimental error must be sufficient to offset the 
reduction in degrees of freedom. ‹ А t 
In areas of research in which the variance associated with the term ту in 
the model in (2) is likely to be large, the control on this source of variation 
provided by a repeated-measures design will greatly increase the sensitivity 
of the experiment. The smaller the proportion of the total variance in an 
experiment due to effects which are not related to the treatments per se, the 


More sensitive the experiment. 


2.8 Combining Several Independent Tests on the Same Hypothesis 


A series of experiments designed to test the same hypothesis may be con- 
ducted at different places or at different times. Suppose that the samples 
used in each of these experiments are independent; i.e., different randan 
samples are employed in each of the experiments. Suppose that the prob- 
abilities of the observed outcomes with respect to the common hypothesis 
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tested are designated P4, Р,....Р, If the experimenter desires to make 
an over-all probability statement, the following statistic may be used: 


(1) y = 25и, where u; = —ln P,. 


Under the hypothesis that the observed probabilities are a random sample 
from a population of probabilities having a mean of .50, the 7? statistic іп (1) 
has a sampling distribution which is approximated by the chi-square dis- 
tribution having 2k degrees of freedom. 

Use of this statistic is illustrated by the data in Table 2.8-1. These data 


Table 2.8-1 Numerical Example 


Experiment | m | Probability —In (probability) 


| .87 20 1.61 T 


1 

2 .54 .30 1.20 

3 1.10 15 1.90 

4 1.50 07 2.66 

5 1.30 11 2.21 
8 7” 


x? = 2(9.58) = 19.16 
х%5(10) = 18.3; x%7s(10) = 20.5 


t df = 14. 


give t statistics obtained in a series of five independent experiments testing 
a specified hypothesis against a one-tailed alternative hypothesis. The 
probabilities of obtaining the observed t statistic or one more extreme in the 
direction of the alternative hypothesis is given in the column headed Proba- 
bility. The natural logarithm of the probability with the sign changed is 
given in the next column. The numerical value for the chi-square statistic, 
19.16, is twice the sum of this last column. k, the number of experiments, 
is 5; hence the degrees of freedom for this chi-square statistic are 2(5) = 10. 
Since the probability of obtaining a chi-square statistic of 19.16 or larger is 
less than .05 when the mean probability in the population is .50, this latter 
hypothesis is rejected (if « = .05). Thus the combined evidence from the 
series of experiments indicates that the common Н, in each of the experi- 
ments should be rejected if inferences are to be made with respect to the com- 
bined populations represented by the series of experiments. 

There is an alternative approach to combining the outcomes of several 
experiments involving the use of the / statistic based upon relatively large 


samples. Under the hypothesis that the mean value for the / statistic in the 
population is zero, the statistic 
о) т Zi 


Jk 
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has a sampling distribution which is N(0,1). For the data in Table 2.8-1, 
87 + .54 + 1.10 + 1.50 + 1.3 

V5 


The probability of obtaining this value of z or one larger when the mean of t 
inthe population is zero is less than.01. Hencethecombined evidence from 
the five experiments indicates that the Н, common to each of the experi- 
ments should be rejected if the scope of the inference is the combined popula- 
tions and if x = .05. The sampling distribution of the statistic in (2) is 
approximately normal in form only when the 1;5 are approximately nor- 
mally distributed; hence in practice use of the statistic should be limited to 
cases in which the degrees of freedom for the г; are larger than 30. 

In some instances one could combine the data from all five experiments 
into a single larger experiment. This procedure might, however, tend to 
obscure rather than clarify the interpretation—the former would be the case 
if differences associated with the unique conditions of the individual experi- 
ments were allowed to augment the experimental error. This contingency 
on be avoided through use of designs which will be discussed in the next 
chapter. 


237. 


CHAPTER 3 


Design and Analysis of 
Single-factor Experiments 


3.1 Introduction 


In testing statistical hypotheses or in setting confidence bounds on esti- 
mates, one uses sampling distributions determined by purely mathematical 
considerations. One postulates a model, imposes various conditions upon 
this model, and then derives the consequences in terms of sampling distribu- 
tions which are valid for the mathematical system. To the extent that the 
model and the conditions imposed upon it approximate an actual experi- 
ment, the model can be used as a guide in drawing inferences from the data. 

If use is to be made of existing, well-developed mathematical models, 
experiments must be designed in a way that makes these models appropriate. 
If an experiment cannot meet the specifications in existing models, the 
experimenter may be able to develop a model tailored to the specific needs of 
his experiment. But the problem of the analysis of the resulting data must 
still be solved. If the mathematics needed to derive sampling distributions 
having known characteristics is manageable, the specially tailored model 
leads to inferences of known precision. Otherwise inferences drawn from 
an experiment of this kind will have unknown precision. 

The analysis of data obtained from an experiment is dependent upon its 
design and the sampling distributions appropriate for the underlying popula- 
tion distributions. The design, in part, determines what the sampling dis- 
tributions will be. Associated with standard designs are analyses justified 
by the mathematical models which led to the construction of these designs. 
Alternative designs are often available for an experiment having stated 
objectives. Depending upon the specificsituations, one design may be more 
efficient than another for the same amount of experimental effort—more 
efficient in terms of the power of the resulting tests and the narrowness of the 
resulting confidence intervals. A major problem in planning an experiment 
is to find or develop that design which is most efficient per unit of experi- 
mental effort in reaching the primary objectives of the experiment. 

46 
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The most efficient design from a purely mathematical point of view may 
be so costly in terms of time, money, and effort as to render it unworkable. 
In general, the smaller the variation due to experimental error, other factors 
being constant, the more efficient the design. This source of variation may 
be reduced by introducing various kinds of controls or by increasing the 
sample size. Both methods of reducing experimental error may be used, 
but which one provides the greater reduction per unit of cost depends upon 
features unique to the experimental conditions. 

The study of the statistical aspects of experimental design will assist the 
experimenter in finding the most adequate and feasible model for his experi- 
ment. This model must permit the experimenter to reach decisions with 
respect to all the objectives of his experiment. Whether or not a model 
adequately represents the experimental situation calls for expert knowledge 
of the subject-matter field. In appraising the adequacy of alternative 
models, the experimenter is often made aware of sources of variation and 
possible implications that he had not thoroughly considered. Thus design 
problems often force the experimenter to formulate his experiment in terms 
of variables that will lead to clear-cut interpretations for the end product. 

Some of the criteria for good experimental designs are as follows: 

1. The analyses resulting from the design should provide unambiguous 
information on the primary objectives of the experiment. In particular the 
design should lead to unbiased estimates. ; 

2. The model and its underlying assumptions should be appropriate for 
the experimental material. ў 

3. The design should provide maximum information with respect to the 
major objectives of the experiment per minimum amount of experimental 
effort. 

4. The design should provide some information with respect to all the 
objectives of the experiment. > “a А 

5. The design must be feasible within the working conditions that exist 
for the experimenter. , Д 

The designs that will be considered in this chapter are appropriate for 
what have been called single-factor experiments. The primary objective of 
this kind of experiment is to compare the relative effectiveness of two or more 
treatments оп а commoncriterion. The term single-factor in this context is 
used in contrast to the term multifactor. In this latter type of experiment 
the primary objective is tocompare the effect of combinations of treatments 
acting simultaneously on each of the elements. There are some instances 
in which the distinction between single-factor and multifactor experiments 1s 
difficult to make. : 2 4 

In this chapter only designs involving independent observations will be 
discussed ; corresponding designs having correlated observations are con- 
sidered in the next chapter. The designs in this chapter forma special case 
of what are called completely randomized designs. These form the building 
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blocks for many other designs; they also provide a standard against which 
the efficiency of other types of designs is measured. 


3.2 Definitions and Numerical Example 


A numerical example will illustrate the definitions of terms used in the 
analysis of a single-factor experiment. The actual analysis of this example. 
willbegivenindetail. Therationale justifying thisanalysis will be discussed 
in the next section. 


Table 3.2-1 
Method 1 Method 2 Method 3 
3 4 6 
5 4 7 n8 
2 3 8 
(i) 4 8 6 k=3 
8 7 7 
4 4 9 
3 2 10 
9 5 9 
бу T, = 38 Т, = 37 T, = 62 С = ХТ, = 137 
ХХ = 224 EX? = 199 EX? = 496 X(EXxj) = 919 
У Т? 2 2 
55-ХХ)-2і ss,-2x37-4 55,-ХХ)-14 — ss = 58, 
n 
(іш) 38° 37% 62° 
= 224 – 2 = 199 —— = 496 — = 
8 8 496 8 
= 43.50 = 27.88 = 15.50 SS, = 86.88 
—— es 
s) Т, = Туп Т, = Tin T; = Туп G = G[nk 
= 38/8 = 4.75 = 37/8 = 4.62 = 62/8 = 7.75 = 137/24 = 5.71 
T The symbol Т; is an abbreviation for the more complete notation given by 
T, = ХХа. 
i 
Similarly, =X? = DX}, 
%. 
і 


An experimenter is interested in evaluating the effectiveness of three 
methods of teaching a given course. A group of 24 subjects is available to 
the experimenter. This group is considered by the experimenter to be the 
equivalent of a random sample from the population of interest. Three 
subgroups of 8 subjects each are formed at random; the subgroups are then 
taught by one of the three methods. Upon completion of the course, each 
of the subgroups is given a common test covering the material in the course. 
The resulting test scores are given in part i of Table 32-1. The symbol л 


designates the number of subjects in f 
methods. J a subgroup and К the number о 
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In part ii of this table, the symbol 7; designates the sum of the test scores 
for the subjects who were taught by method j. For example, Т) designates 
the sum of the test scores for the subjects taught by method 1. The symbol 
С designates the grand total ofallobservations intheexperiment. Gismost 
readily obtained by summing the 77's; that is, С = 38 + 37 + 62 = 137. 
The symbol ХХ? designates the sum of the squares of the observations on 
the subjects taught by method j. For example, XXj = 3° + 5 + +++ + 
% = 224. 

In part iv, Т, designates the mean of the test scores for subjects taught by 
method j. For example, T, = 4.75 is the mean test score for the subjects 
under method 1. The symbol 7 designates the grand mean ofall test scores. 
When there is an equal number of subjects in each of the subgroups, G = 
(ЕТ,)/К, where k is the number of subgroups; this relationship does not 
hold if the number of subjects within each subgroup varies. 

The symbol SS, in part iii designates the sum of squares, or variation, of 
the test scores within the group under methodj. By definition, the variation 
of the observations within method / is 


(1) SS, = У(Х — T)*, 


i.e., the sum of the squared deviations of the test scores under method j 
about the mean of subgroup/. This definition is algebraically equivalent to 


(2) ss, ==x?-—, 


which is a more convenient computational formula. Use of this formula is 
illustrated in part iii. ThesymbolSS,, designates the pooled within-method 
variation (or sum of squares). By definition, SS, is the sum of the variation 
within each of the methods, 


9) SS, = ESS}: 

A computationally more convenient formula for SS,, is 
ЖТ; 

% 55, = zx -=A 


For the data in the table, (4) is 
(38)? + (37)? + (62) 
8 


55.0195 
— 919 — 6657 _ 919 — 832.12 = 86.88. 
8 


This value for SS,, is the same as that obtained in the table using (3). ім 
The variation (or sum of squares) due to the methods of training is by 


definition 
6) SSmethods гг nX(T; = Gy. 
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This statistic measures the extent to which the means for the subgroups differ 
from the grand mean. It is also a measure of the extent to which the sub- 
group means differ from one another. Interms of this latter interpretation, 
(5) may be shown to be algebraically equivalent to 
nX(T, — Ty 

k(k — 1) 
where the symbol 7, — T, designates the difference between a pair of means. 
For the data in the table, (6) is 


8[(4.75 - 4.62)? + (4.75 7.15)? + (4.62 — 7.75)"] 


(6) SSmethoas = 


SSmetnoas 


3(2) 
= 50.17. 
Using (5), 
SSmethoas = 8[(4.75 — 5.71)? + (4.62 — 5.71)? + (7.75 — 5.71)?] 
550717; 


Neither (5) nor (6) is a convenient computational formula for 55,100) the 
latter is given by 
2T? G 
(7) SSmethoas == – — B 
n nk 


For the data in the table, (7) is numerically 
(38)? + (37)? + (62)? (137)? 
8 24 
= 832.12 — 782.04 = 50.08. 
The numerical value for $$ а computed from (5) and (6) involves more 
rounding errors than does the computational formula (7). 


The variation due to experimental error is, by definition, the pooled 
within-method variation, 


(8) SSerror = SS, = У55,. 
This statistic measures the sum of the variation within each of the sub- 


groups. Its computational formula is given by (4). Тһе total variation, 
or total sum of squares, is 


(9) ЕЕ 6). 


the sum of the squared deviation of each observation in the experiment about 
the grand mean. Its computational formula is 


SSmetnods 


2 
(10) Stora = (ЭХ?) — 5. 
For the data іп the table, $ 
SSmi = 919 — (1377 
24 


— 136.96. 
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From these definitions of 55,1, SSmetnoass and SS,,,, it may be shown 
algebraically that 

SStotai = SSmetnoas ар SSerror, 
ХХХ,, Gy nX(T, G) 4 ХХХ; Ty. 
The relation (11) describes a partition, or division, of the total variation into 
two additive parts. One part is a function of differences between the mean 


Scores made by the subgroups having different methods of training; the other 
part is the sum of the variation of scores within subgroups. Numerically, 


136.96 = 50.08 + 86.88. 


The partition represented by (11) is basic to the analysis of variance. Its 
derivation is not difficult. Let 
а= X,—T, and, bj — T —G. 
Then а + b = Xi, — б, 
ad Уух, — 07 = Хуа, + by = УУФ + УУМ рум, 


(11) 


The term at the extreme right is 
XXab;— Xa) =0, 
ғ i 


Since Х,а, = Oforeachj. (Thatis, E,a;; is the sum of deviations about the 
mean of observations in class j.) Since d? is a constant for all 5 in the same 


class, 
УУР = пу}. 
із і 
Непсе IIX,- GP = dda; + "УМ 
YQ D + "Хт; — б)% 
ғ 


A variance, іп the terminology of analysis of variance, is more frequently 
called a mean square (abbreviated MS). Ву definition 


variation ЕЗ 55 Р 
degrees of freedom df 


In words, a mean square is the average variation per degree of freedom; 
this is also the basic definition for a variance. The term mean square 15 a 
More general term for the average of squared measures. Hence a variance 
is actually a special case of a mean square. А д 
Тһе (егіп degrees of freedom originates from the geometric representation 
of problems associated with the determination of sampling distributions for 
statistics, In this context the term refers to the dimension of the geometric 
Space appropriate in the solution of the problem. The following definition 


(12) Mean square = 
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permits the computation of the degrees of freedom for any source of varia- 
tion: 
no. of independent no. of independent 
(13) Degrees of freedom = | observations on | — | parameters estimated | 
source of variation in computing variation, 


In this context, a statistic may be used either as anestimate of a parameter or 
as a basic observation in estimating a source of variation. The source 
of variation being estimated will indicate what role a particular statistic will 
have in a specified context. More accurately, 


(13a) df = (no. independent observations) — (no. linear restraints). 


Substituting an X; fora u; in the computation ofa mean square is equivalent 
to imposing a linear restraint upon the estimation procedure. The sub- 
stitution restricts the sum of a set of observations to be a specified number. 
For example, if the mean of four scores is required to be 10, and if the first 
three observations are 

3, —5, and 20, 


then the fourth score must be 22; that is, the total must be 40. Under this 
restraint on the mean, only three of the four scores are free to vary. Hence 
the term freedom. 

In the computation of 58,4 the Ту are considered to be the basic 
observations; SSmemoas is a measure of the variation of the 7/s. Thus, there 
are k independent observations in SSinetnous» In the computation of this 
Source of variation, Gis used as a parameter estimating the mean of the 77's. 
Hence one estimate of a parameter is used inthe computation of 55,4% 
pun. by (13), the degrees of freedom for this source of variation are 


An alternative, computational definition of the degrees of freedom for a 
Source of variation is 


hors d no. independent points 
(13b) Degrees of freedom — (беа) — (ші which deviations |. 


are taken 
For example, SSmetnods = nX(T, — Gy 
taken about the single point G. Hence the degrees of freedom are k — 1. 
As another example, 58, ог = USS; involves n squared deviations for each 
of the k subgroups, or a total of kn Squared deviations. Within each sub- 
group the deviations are taken about the mean Т, of that subgroup. Since 
there are k subgroups, there are k different points about which the deviations 
are taken, Hence the degrees of freedom are nk — k for 56,2. 

‚ SSerror is the pooled variation within each of the subgroups. The varia- 
tion within the subgroup j is a measure of the extent to which each of the n 
observations deviates from the mean of the subgroup, 7, For this source 
of variation 7, is used as an estimate of the mean for population j. Hence 


involves k squared deviations all 
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the number of degrees of freedom for the variation within subgroup j is 
n—1, The degrees of freedom for the pooled within-subgroup variation 
is the sum of the degrees of freedom for each of the subgroups. Ifthe varia- 
tion within each of k subgroups has — 1 degrees of freedom, then the total 
degrees of freedom for k subgroups is k(n — 1), that is, the sum (n — 1) + 
(n — 1) + = + (л — 1) for k terms. 

SStota1 is the variation of the nk independent observations about the grand 
mean С. Неге С is an estimate of the over-all population mean. Hence 
SStoiai is based upon nk — 1 degrees of freedom. Corresponding to the 
partition of the total variation in (11), there is a partition of the total degrees 


of freed 
x EVE Ont dftotaı = dfmetnoas + dferror, 
(14) kn — 1 = (k — 1) + (kn — K). 


Table 3.2-2 Summary of Analysis of Variance 
EE ar a a es 


Source of variation Sum of squares | Degrees of freedom Mean square 


Between methods SSmethods = 50.08 k—1= 2 | MSmethoas = 25.04 
Experimental error SSerror = 86.88 kn —k = 21 MSerror = 4.14 
Total SStotaı = 136.96] kn —1 = 23 


A summary of the statistics used in the analysis of a single-factor experi- 
ment is given in Table 3.2-2. The numerical entries are those computed 
from the data in Table 3.2-1. À 

Assuming that there is no difference in the effectiveness of the methods 
of training, as measured by the mean scores on the test, and making addi- 
tional assumptions which will become explicitin the next section, the statistic 
MSnmetnods 

MSerror 

has a sampling distribution which is approximated by an F ela 
having k — 1 degrees of freedom for the numerator and kn — k degrees Я 
freedom for the denominator. Thus ће F statistic may be used a vA 
hypotheses about the equality of the population means for the methods. 


To test the hypothesis that the population means for the test scores ош, 
that is, ш, = u, = us, against a two-tailed alternative hypothesis, the 


decision rules are as follows: 
Reject Н, when Ғо > F1-4 
Otherwise do not reject Hy. 


For the data in Table 3.2-2, Fos = 25.04/4.14 = 6.05. Опе vod 
æ = 05 and « = .01 are, respectively, Figs(2,21) = 3.47 and it а 
5.78. In this case Fons exceeds the Critical value for « = .01. Hence 


(15) ре 


(k — 1, kn — k): 
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data do not support the hypothesis that the population means are equal. 
Inspection of the means in Table 3.2-1 indicates that method 3 has the 
largest mean. 

The experiment represented by the data in Table 3.2-1 is a special case of a 
single-factor experiment. For this case, the experimental variable is the 
method of training. In the general case the term treatment will be used 


Table 3.2-3 General Notation 


Treatment 1 Treatment j Treatment k 

Xu Xy Xu 

Xn Xoj Xor 
@ 

Xa X; Xir 

Ха Xni Xnr 
k Ti Es T, es T. G 
Gi) xxi zx; xxi xxx» 

1 j Т, G 
хаа G 
SStreat = WS MES Ше = k 1 

m УТ? 
(iii) SSerror = X(2X3) — = dferror = kn — k 


" G 
SStotar = (АЎ) — i ош = kn — 1 


——— À— —— ыы € Еа 


Sums of squares in terms of 


Computational symbols computational symbols 
(iv) (1) = G?|kn SS = (3) - (1) 
/ treat = 
Q) = xen) БЕ М2) =G) 
G) = (Етп SStotar = (2) - (1) 


—— À—— M MÀ à —ÓÀ 


interchangeably with the terms experimental variable, experimental con- 
dition, or whatever it is that distinguishes the manner in which the sub- 
groups are handled (treated) in the experiment. The elements assigned to à 
treatment constitute what will be called a treatment class. A general nota- 
tion for the observed data in a single-factor experiment having n observa- 
tions in each treatment class is given in part i of Table 3.2-3. For example, 
an observation on the element żin treatment class jis designated Y; Nota- 
tion for the totals required in the computation of the sums of squares appears 
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in part ii, For example, T; designates the sum of all observations in treat- 
ment class j, and > Х? designates the sum of the squares of the observations 
in treatment class j. Both sums are generally obtained in a single operation 
ona calculator. 

In part iii the computational formulas for the sums of squares used in the 
analysis of variance and the associated degrees of freedom аге given, Acon- 
venient method for summarizing the computational formulas is given in part 
(iv) in terms of what may be called computational symbols. For example, 
the symbol (2) designates the numerical value of X(XX;). 

The degrees of freedom for a sum of squares may be computed directly 
from the computational formula by means of the following rule: Count the 
number of quantities which are squared in a term. Then replace this term 
by this number in the computational formula. For example, in the term 
(“Т/п there are k quantities that are squared (Ti, 7,..., Т,). In the 


Table 3.2-4 General Form of Summary Data 


Source SS F 
f х URS MStreat 
Treatments SStreat ki MS treat ДУ MSerror 


Experimental error SSiror kn —k MSerror 


Total SStotat kn = 1 


term G?/kn there is just опе term which is squared, namely, G. Hence the 
degrees of freedom of the sum of squares defined by [(27})/n — G?|kn] are 
k — 1. As another example, in the term X(X X3) there are kn terms that are 
squared, namely, each of the kn individual observations. Hence the sum of 
squares defined by [£(ZX7) — G?/kn] has kn — 1 degrees of freedom. 

The general form used in summarizing the analysis of variance for a 
single-factor experiment is given in Table3.2-4. The F statistic 1s песа 
testing the hypothesis that ш = Js» — 7 = My against the equivalent о | 
two-tailed alternative hypothesis. If this hypothesis is rejected, additional 
tests are required for more detailed information about which means are 
different from theothers. Specialized tests for comparing individual means 
with each other are discussed in later sections. j 

The formal method for testing statistical hypotheses requires that the level 
of significance ofa test be set in advance of obtaining the data. Convention 
in the analysis of variance is somewhat opposed to this procedure. The 
value of Foy; is generally compared with tabled critical values, and the out- 
come is described in terms of the statement: Fons exceeds a specified per- 
centile point (usually the .95 or the .99 percentile points). The ШЕ ien 
level of significance is thus in part determined by the observed data. T his 
procedure is not objectionable for purposes of estimating the probability 
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of the observed outcome in terms of an assumed underlying sampling distri- 
bution. However, this procedure does not permit the evaluation of the 
power associated with the test. 


3.3 Structural Model for Single-factor Experiment—Model I 


Suppose all elements in a specified basic population are given treatment 
1. After the treatment, the elements are measured on a criterion related to 
the effectiveness of the treatment. Assume that the distribution of the 
resulting measurements is approximately normal in form, with parameters 


Table 3.3-1 Parameters of Populations Corresponding to Treatments 
in the Experiment 


Treatment Population mean Population | Treatment 
| variance | effect 
- Imp mq Й 
1 My о | =m- h 
2 Hg оў Ta = [lg — и 
. | : К 
|| 
і м 4 | --ы-» 
й | 
s 
J "y оў Tp = My — Н 
k a қ Pk | оў Th = My =H 
Grand mean = и | m 


у^ +т 4 A 4r (ШШ. + А.П SET ЖШ ШИЛ БИДИН 


ш апа oj. Thus, ш, and o? are the parameters of a population of measure- 
ments that would exist if treatment 1 were administered. This potential 
population will be designated as that corresponding to treatment 1. Sup- 
pose, instead, that treatment 2 is administered to all the elements and then 
measurements on the same criterion of effectiveness are made. Assume 
that the resulting distribution is approximately normal in form, with pa- 
rameters и, and o2. 

Corresponding to each treatment about which the experimenter seeks to 
make inferences, there is assumed to bea population of approximately nor- 
mally distributed criterion measures. The number of such populations is 
equal to the number of treatments. Assume that the number of treatments 
із К. Intheexperiment, data are obtained on k of the possible K treatments. 
If k = K, then observed data are available on all treatments in the domain to 
which inferences are to be made. If k is less than K, then observed data are 
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available on only some of the treatments about which inferences are to be 
drawn. In this section the case in which k = K will be considered. Other 
cases are considered in the next section. 

The parameters defining the treatment populations are summarized in 
Table 3.3-1. When k = K, the grand mean y is 
Хн. 

k 

The effect of treatment j, designated ту, is the difference between the mean 
for treatment j and the grand mean of the population means, 
(2) T, = Иа 
Thus т; is a parameter which measures the degree to which the mean for 
treatment / differs from the mean of all other relevant population means. 
Since the sum of the deviations about the mean of a set is zero, Er; — 0; 
hence the mean of the 7,’s, designated 7, is equal to zero. Н 

Let X,, be the criterion measure on a randomly selected element / in treat- 
ment population j. The following structural model is assumed for this 
measurement, 


(3) Xj =U 7; Eip 


where и = grand mean of treatment populations, 

т; = effect of treatment j, 

ғу = experimental error. ( 
The term уг is constant for all measurements in all treatment populations. 
The effect т; is constant for all measurements within population j; however, 
a different value, say, ry, is associated with population j^, where j’ represents 
Some treatment other than j. The experimental error ғ;; represents all the 
uncontrolled sources of variance affecting individual measurements; this 
effect is unique for each of the elements i in the basic population. This 


effect is further assumed to be independent of 7;. 3 ue 
Since both и and т, are constant for all measurements within population 


J, the only source of variance for these measurements is that due to experi- 
Mental error. Thus 

(4) È= d, 

Where o? designates the variance due to experimental error for the measure- 
ments within treatment population J: If Ху represents а measurement in 
population j’, then (3) takes the form 

(5) Xy =U + Ty + Eir 

The variance within population /” is due solely to the experimental error; 
hence 

(6) оў = Oy 


@) в 
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If the elements (subjects, animals) that are observed under each of the 
treatment conditions are assigned at random to these conditions, one has 
some degree of assurance that the error effects will be independent of the 
treatment effects. Hence the importance of randomization in design 
problems. The elements, in this context, are called the experimental units. 
Random assignmentof the experimental units to the experimental conditions 
tends to make the unique effects of the units per se independent of the treat- 
ment effects. 

The experimental error measures all uncontrolled effects which are not 
related to the treatments. As such, the experimental error is the combined 
effect of many random variables that are independent of the treatment 
effect. Under these conditions, it is reasonable to assume that the distribu- 
tion of the ¢,;’s within population j will be approximately normal in form, 
with expected value ш, = 0 and variance оё. If the sources of experi- 


mental error are comparable in each of the treatment populations, it is also 
reasonable to assume that 


(7) ip саг 


This last relationship may be written in more general form as 


(8) e = а = +++ = а? = д, 


where c? is the variance due to experimental error within any of the treatment 
populations. 

To summarize the assumptions underlying the structural model (3), a 
measurement X; is expressed as the sum of three components: (1) a com- 
ponent и which is constant for all treatments and all elements; (2) a com- 
ponent т; which is constant for all elements within a treatment population 
but may differ for different treatment populations; (3) a component ғ; 
independent of 7;, and distributed as N(0,62) within each treatment popula- 
tion. This structural model is called the fixed-constants model, or model I. 
The component и in model I is actually an unknown constant; the com- 
ponent т, is a systematic or fixed component which depends upon the 
difference between the means of the treatment populations; the component 
21 18 а random component (or a random variable) depending upon uncon- 
trolled sources of variances assumed to be drawn randomly from a popula- 
tion in which the distribution is N(0,02). 

A parameter indicating the extent to which the treatment effects differ is 


(9) dul hy ay 
k—1 k—1 ° 


An equivalent definition in terms of differences between treatment effects is 


(10) o? X, = Ty)" У(и; — n, 
Ss) KE t 
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where the summation is over all the different possible pairs of means; there 
are k(k — 1)/2 such distinct pairs. When the treatment effects are equal, 
ie, when ті = 7; = > = ту, o? will be zero. The larger the differences 
between the 7's the larger will be o?. Thus the hypothesis specifying that 
o? = 0 is equivalent to the hypothesis that specifies т, = ту = = т, or 
Ba a =ч=н ДЕ 

Statistics useful in estimating the parameters in Table 3.3-1 are sum- 
marized in Table 3.3-2. Since model I assumes that the population 


Table 3.3-2 Estimates of Parameters of Treatment Populations 


Sample Sample 
ES | Treatment Sample mean VAS Treatment effect 
n 1 T st LT 
n 2 T, we tf =1,-G 
n j T; sj 1 = T; -6 
п ТІ 7; 9 ty =Ty 6 
n k Т, 5 ty = T, —6 
Grand mean — G і-0 


_——_—————_——————— 
variances аге all equal to о?, the best estimate of this parameter is the 
pooled within-class sample variance 

Xsj 
(11) Spooled тж е = MSerror- 


For this design the pooled within-class variance is designated MSerror. 
The latter is an unbiased estimate of o7; that is, 


E(MSerror) = 02. 


In terms of the structural model in (3), the treatment means for the samples 
of n observations may be expressed as 


(12) 
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where &; is the mean experimental error for a sample of n observations 
within treatment class /. For random samples, it is reasonable to assume 
that E(é;) = constant. Without any loss in generality, this constant may 
be assumed to be zero. Therefore, 


ET) = тт, 
(13) т As cn 
ETI sce 


The first relation in (13) may be interpreted as follows: If an infinite number 
of random samples of size n are given treatment | and the statistic Т, is 
computed for each of the samples, the distribution of the resulting statistic 
would have an expected value (or mean) equal to u + ту. 

The statistic f; іп Table 3.3-2 may be represented as 


(14) „= T, — G= (u T£) — (и +8) 


=7,+ 6; — ё. 


where 2 is the mean of the é,’s. The expected value of t; is equal to ту, since 
the expected value of the quantity ғ, — £ equals 0. 

A measure of the degree to which the sample means for the various treat- 
ments differ is provided by the statistic 


nXT,— Gy _ пуй 


(15) MStreat = aay Ex 


Inspection of the right-hand side of the relation (14) indicates that differences 
among the 775 depend upon differences among the 7,’s and differences 
amongthez/s. Since the т/$ and the ғ/ are assumed to be independent, 
the variance of the 7"s is the sum of the variance of the 7’s and the variance 
of the é’s. Therefore n times the variance of the 77 has the expected value 


2 z2 
(16) р Eur pe 
k—1 k—1 

= no? + поё. 


Since the variance of the mean of n observations is n times the variance of the 


individual observations, using the relationship that o? = no? permits (16) 
to be written as 


(17) E(MStreat) = no? + o. 


Thus, when ту = т 2-77: = ту, the term o? is equal to zero and the expected 
value of MStreat is о. This implies that MSu is an unbiased estimate of 


the variance due to experimental error when there are no differences among 
the treatment effects, 
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A more general approach to the result obtained in (17) is to start from the 
relation 
(18) VAI ae Monica) ae (ғ, — ғ,). 
This relation is obtained from (12). In this form, a difference between two 
treatment means estimates a difference between two treatment effects and a 
difference between two average error effects. Further, T, — T, provides 
an unbiased estimate of т; — ty. MStreat is readily expressed in terms of all 
possible pairs of differences among the 775. Similarly o? and о? may be 
expressed in terms of the differences on the right-hand side of (18). In the 
derivation of the expected values for the mean square due to treatments, the 
more direct approach is by means of relations analogous to (18). 


Table 3.3-3 Expected Values of Mean Squares 
(n Observations per Treatment) 


Source df MS E(MS) 


Treatments k-1 MStreat в? + no? 
Experimental error | km —k | MSerror o 


The expected values of the mean squares which are used in the analysis of 
variance are summarized in Table 3.3-3. In terms of these expected values, 
since the sampling distributions of MStreat and MSerror are independent, 


(Menos) ire о? ki: no; 
MSres o 

When o? = 0, the expected values of the numerator and denominator of this 
Tatio are both equal to o?. Hence numerator and denominator are inde- 
pendent, unbiased estimates of o? when o? = 0. The independence of the 
estimates follows from the fact that the sampling distribution of the within- 
class variance is independent of the class means. MSerror is obtained from 
the within-class data, whereas Мүге is obtained from the class means. 

Thus, when о? = 0, the sampling distribution of the statistic 


MSireat 
4 i MSerror 
is an F distribution having k — 1 degrees of freedom for the numerator and 
kn —1 degrees of freedom for the denominator. This F statistic may be 
used to test the hypothesis that оё = 0, which is equivalent to the hypothesis 
Шат, = т, = -+-+ = т. The assumptions under which this test is valid are 
those underlying model I; these assumptions were discussed in connection 
with the structural model. The magnitude of type | error is not seriously 
affected if the distributions depart moderately from normality or if the pop- 
ulation variances depart moderately from equality; i.e., the test is robust 
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with respect to the assumptions of normality of distribution and homo- 
geneity of error variance. А study of the effect of lack of homogeneity of 
error variance is given in the work of Box (1954). 

When c? + 0, the expected value of the F statistic will be greater than 1.00 
by an amount which depends upon the magnitude of o?. Thus, if the F 
ratio is larger than 1.00 by an amount having low probability when o? = 0, 
the inference is that c? 52 0. 

The notation ave(MS), read average value of a mean square, is used by 
some authors in place ofthe notation E(MS). Also the notation о? is some- 
times reserved for use in connection with model II, and no special symbol is 
used to designate the quantity n(£7?)/(k — 1). Thus, for model I 

2 
E(MStreat) = о? кір = . 


As long as the assumptions underlying the model are made explicit, no 
ambiguity will result in the use of the symbol o? in model I. 


3.4 Structural Model for Single-factor Experiment—Model II 
(Variance Component Model) 


One of the basic assumptions underlying model I is that all the treatments 
about which inferences are to be made are included in the experiment. 
Thus, if the experiment were to be replicated, the same set of treatments 
would be included in each of the replications. Model I is usually the most 
appropriate for a single-factor experiment. If, however, the k treatments 
that are included ina given experiment constitute a random sample from a 
collection of K treatments, where k is small relative to K, then upon replica- 
tion a different random sample of k treatments will be included in the 
experiment. 


Model II covers this latter case. Тһе structural equation for model IL 
has the same form as that of model I, namely, 


(1) Xi =U FT; + ву. 


However, the assumptions underlying this model are different. The term и 
is still assumed to be constant for all observations; the term e, is still 
assumed to have the distribution N(0,02) for all treatments; but the term 7; 
is now considered to be a random variable. Whether or not a given treat- 
ment is included in an experiment depends upon a random sampling pro- 
cedure. Within the sample of n observations in treatment class DEAS 
considered to be a constant. The distribution of т; is assumed to be 
N(0,02). 

The estimates ofthe parameters in (1) have the same form and are numeri- 
cally equal to estimates made from data which assume modell. From the 
point of view of the computation of estimates of the various effects, the two 
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models do not lead to different procedures. For samples of size n, the 
expected values of the mean squares of the estimators are as follows: 


Source | af | MS | E(MS) 
| ә 


Treatments i MStreat | 02 + »( - ios 


Experimental error | kn —k MSerror | в? 
If k is small relative to К, then the coefficient 1 — (К/К) becomes very close 
to 1,00 and the expected value of the mean square for treatments becomes 
a; + по?. This latter value was the one that was obtained under model I. 
The test on the hypothesis o? = 0 is identical with that made under model I. 
In more complex designs, the F ratios for analogous tests do not have iden- 
tical forms. 


3.5 Methods for Deriving Estimates and Their Expected Values 

Under model I, the structural equation for an observation has the form 
(1) Ху =M 7; Ғе» 
where y is a constant for all observations, т; is a constant (or fixed variable) 
for observations under treatment j, and ғ;; is a random variable independ- 
ent of 7; having the distribution N(0,o7) for all treatments. Thus, there are 
k fixed variables and one random variable in the model. This model is 
similar to that underlying a k-variate multiple regression equation; hence 
the methods for the solution of estimation problems in multiple regression 
are applicable. The method of least squares, which is used in the solution of 
multiple regression problems, will yield unbiased minimum-variance esti- 
mators when applied to estimation problems posed by model n (The 
usual multiple regression problem involves correlated predictors. ү In this 
case, the predictors are uncorrelated. The latter restriction simplifies the 
computational procedures, but the underlying principles are identical.) 

The generalization of model I forms a major part of what is known 
as classical analysis of variance; estimation problems in this area may be 
handled by means of least-squares analysis or by means of maximum- 
likelihood methods. The two approaches lead to identical results for this 
model. 

To illustrate the least-squares approach to (1), let m, ty, %,. 
€; designate the respective least-squares estimators of Hy Tis Tos + + 
ғ, The method of least squares minimizes the quantity 


(2) Ere, = УУ(Х — т — 1)? = minimum. 
i£ 9 


..„ fj, and 
+) Te, and 


By taking partial derivatives with respect to the estimators m, thy tay ees fi 
(this is the procedure used to obtain the normal equations in the multiple 
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regression problem) and setting these derivatives equal to zero, one obtains 
the equations 
knm + nXt; — G = 0, 
nm + nt, — T, = 0, 
(3) nm + nt, — T, = 0, 


nm + nt, — T, = 0. 


There are k + 1 equations and k + 1 unknowns; however, the k + 1 equa- 
tions are not independent, since the first equation may be obtained by sum- 
ming the last К equations. A unique solution to these equations requires a 
restriction on the variables. Since Ут, = 0, a reasonable restriction is that 
Уг, = 0. The solutions to (3) now take the form 


G 
LUC 
m is 
h T, — nG 7, — 6, 
(4) n 
cob CAR д 
n 
числ Sip 
n 


Each of theseestimatorsis an unbiased estimate of the corresponding param- 
eterin(1). Thus, 


E(m) = E(G) = и; E(f) = Е(Т, — б) = т; E(t) = E(T, — б) =» 


Although the expected value of г, is the parameter т, in terms of the 
parameters of the model, t; has the form 


(5) h= =G (, ЗЕ ұза) (u " пУт; = 
п 


kn kn 
ET) 8 =, 
where £, = (X,e,)|n and ғ = (EXej)kn. Thus, 
(6) MStreat = пу; пУ(т, + ғ; i= 8r 
k—1 k—1 


Since the z's and the e’s are independent, the expected value of MSireat will 
be n times the sum of the mean squares of the parameters on the right-hand 
side of (6), that is, n(o? + 02) = no? + o2, This is a general property of 
least-squares estimators: the expected value of the mean square of an esti- 
mator is the sum of the respective mean squares of the parameters estimated. 
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The term 
(7) е = Xj—m—t,2X4—6—(T,—G) 
= Xu — T= +7; + eu) — (B 7; +E) 
= у — &. 
Thus 
ZX(X,— Ty 4 
8 Е 2) 3l ij E T 
(8) (ег;) КЕЛ ZA EW: 2 


Model II, which has more than one random variable, does not permit the 
direct application of least-squares methods. Instead, maximum-likelihood 
methods are used to obtain estimators and the expected values of the mean 
squares of these estimators. General methods for obtaining the expected 
values for mean squares under model II are discussed by Crump (1951). 

In experimental designs which are discussed in later chapters, a mixture of 
model I and model П often occurs. The expected values of the mean squares 
in designs having a mixed model depend upon the randomization restrictions 
which are imposed upon the variables. When different sets of restrictions 
are imposed, different expected values for the mean squares are obtained. 


3.6 Comparisons among Treatment Means 


A comparison or contrast between two treatment means is by definition 
the difference between the two means, disregarding the algebraicsign. Thus, 
T, — T, defines a comparison between these two means. Comparisons 
among three treatment means can be made in several different ways. Each 
Mean may be compared or contrasted with each of the other means, giving 
rise to the following differences: 


T, T, T, T, T, Ts. 
Another set of comparisons may be obtained by averaging two of the three 


means and then comparing this average with the third mean. This pro- 
cedure gives rise to the following differences: 


T, 4- T, Т. Т, + Т т, TET т 
2 2) 2 2 


In general, a comparison or contrast among three means is an expression of 
the form 


а) ef, + Т + Тә, where Zc, = 0. 


[Any expression having the form 
wT, + WaTa + м, 
lled a linear function of the T’s. 


where there is no restriction on the w’s, is ca [ t | 
lized linear function of the 7"s.] 


A comparison, as defined in (1), is a specia 
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For example, if c, = 1, сь 1, and с; = 0, (1) becomes 
ІТ, + (C1)7, + (0)7, = 7, — Th. 
If c, = 0,c, = 1, and c, 1, then (1) becomes 
T, — Т, 
Ifc; = $, cg = —1, and c, = +, then (1) becomes 
(Df + Cf, + qr, 2 т, 


The general definition of a comparison or contrast among k means has 

the form 
(2) CT, + oT, be, where Ec, = 0. 
The expression (2) is called a linear combination or weighted sum of the 
means. If the sum of the weights in this expression is equal to zero, then the 
linear combination is a comparison or contrast among the means. When 
the number of observations in each treatment class is equal to n, it is more 
convenient to work with totals rather than means. A comparison along 
the treatment totals is defined by 


(3) CT, + 6T; + °° + eT, where Жс, = 0. 
A comparison between the first and second treatments is obtained by setting 
сі = 1,с = —1, and setting all other c's equal to zero. 


Orthogonal comparisons will be defined by means of a numerical example. 
Consider the following comparisons: 
G=(-3)h+(-)R+ DR + (307, 
с, (OT, + (CDT + (CDT + (Ty, 
бұ- (01+ (07,--(-1)Т,--(-1)Т,. 


For comparisons С, and С,, the sum of (һе products of corresponding 
coefficients is 


(-3Х1) + (—1)(—1) + @)(—1) + Y) = 0. 
For comparisons С, and Су, the sum of the products of corresponding 
coefficients is 

(—3)() + (11) + (1(—1) + (1)(—1) 6. 
Two comparisons are orthogonal if the sum of the products of corre- 
sponding coefficients is equal to zero. Thus, the comparisons C, and C, are 
orthogonal, but the comparisons С, and C, are not orthogonal. To find 


out whether or not C, is orthogonal to C;, one forms the sum of products 
of corresponding coefficients, 


x ; (XX) + CDW + (—1)—1) 4- (001) = 0. 
Since this sum is equal to zero, the two comparisons are orthogonal. The 


concept of orthogonality in this context is analogous to the concept of 
nonoverlapping or uncorrelated sources of variation. 
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A component of the sum of squares for treatments is defined by the expres- 
sion 
(cT; + E + eT)” 
n(d + ce + +++ сд) 
(It is assumed that there are л observations in each treatment total.) In 
words, a component of a sum of squares is the square of a comparison 
divided by n times the sum of the squares of the coefficients in the comparison. 
A component of a sum of squares has one degree of freedom because it 
basically represents the squared difference between two observations, each 
observation being the weighted average of treatment means. A sum of 
squares based upon two basic observations has one degree of freedom. 
Two components of the sum of squares for treatments are orthogonal if 
the comparisons in these components are orthogonal. Any treatment sum 
of squares having k — 1 degrees of freedom can be divided into k — I 
orthogonal components; there are usually many ways in which this can be 
done. A treatment sum of squares can also be subdivided into a relatively 
large number of nonorthogonal components. Orthogonal components 
are additive; i.e., the sum of the parts equals the whole. Each component 
in an orthogonal set covers a different portion of the total variation, Non- 
orthogonal components are not additive in this sense. 
The computation and interpretation of the components of the sums of 
squares for treatments will be illustrated by means of the numerical example 
used in Table 3.2-1. For these data, 


n=8, К=з ПЕШ Ty oie ly 02) 
SSmethoas = 50.08, MS error = 4.14. 
Consider the following comparisons: 
С, = (1)7, + (-)h + OR = Т, — Te = 38 — 37 = 1, 
Gr (DT + (0+ (52) 9 1-4 1, 21, = 38 + 37 — ый 
С, represents a comparison between methods 1 and 2, method 3 being dis- 
regarded. This comparison is used in testing the hypothesis that и — H 
= 0. С, represents a comparison between method 3 and the average of 
methods 1 and 2. This comparison is used in testing the hypothesis that 
(а + 1/2] — из = 0. С, and C; are orthogonal, since the sum of the 
products of corresponding coefficients is zero; that is, 
(11) + C700) +(0(—2) = 0. 
The component of the sum of squares corresponding to C; is 
sS, = знн = ш 
быч ОСОО 80 

s follows: Of the total variation among the 
= 50.08, that part which is due to the 


2 where Хс = 0. 


This component is interpreted a 
methods of training, SSmethods 
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difference between methods 1 and 2 is SSc, = .0625. The component 
corresponding to C, is 

(,+1,—21*_ (-Фу 
Si? + 0) + (—2)#] 8(6) 
Thus, of the total variation between the methods, that part which is due to 
the difference between method 3 and the other two methods combined is 
SSc, = 50.02. Method 3 appears to be clearly different from the other two 
methods; there is very little difference between methods | and 2. 

Since a component of a sum of squares has one degree of freedom, there 
is no difference between a sum of squares or a mean square for a component. 
The expected value of the sum of squares for C, is 


5 = 50.02. 


E(SSo,) = o? + 5 — n. 


Under the hypothesis that т, — т, = 0, E(SSc,) = o}. Hence the ratio 
E(SS,..) а? 

LO. =— = 1.00, һ = Tg = 0. 

E(MSerror) оз isis tell 
Thus the statistic 

_ 55с 
' MSerror 

may be used in a test of the hypothesis that z, — т, = 0. This F ratio has 
one degree of freedom for the numerator and kn — k degrees of freedom for 
the denominator. For these data Е (1,21) = 4.32. The numerical value 
of Fons is 


This F ratio is less than 1.00 and hence does not exceed the critical value 
4.32. Thus, the observed data do not contradict the hypothesis that 


Ti тта = 0, 
The hypothesis that (т, + 7,)/2] — =, = O is tested by use of the statistic 
55, 50.02 


= —— = 42.08. 
MSerror 4.14 


The critical value is again Fos(1,21) = 4.32. Thus, the observed data 


contradict the hypothesis that method 3 is no different from the average 
effect of methods | and 2. 


The comparison used in a test of the hypothesis that z, — т, = 0 is 
D- Tg = 38 — 62 = —24. 
The component corresponding to this comparison is 
(T, — Т)? xe 241 
n[Qy* + (—1)°] 8(2) 


F= 


= 36.00. 
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The F ratio for this test is 
F= 36.00 
4.14 
The critical value for a .05-level test is F,,(1,21) = 4.32. Since the ob- 
served value of F exceeds 4.32, the hypothesis that 7, — 7, = 0 is rejected. 
The sum of squares for methods, SS,,etnoas = 50.08, has two degrees of 
freedom. Since $$, and SS,, are orthogonal, they represent nonover- 

lapping (additive) parts of SS, Hence 
SSmethoas = SSc, Gij 580. 


50.08 = .06 + 50.02. 
С, and С, are not the only pair of orthogonal comparisons that may be 
formed. For example, 
C= (DT + OR + (01 
C, = (—1)Т, + @)7» + (—1)7% 
аге also orthogonal. The components corresponding to these comparisons 
will also sum to SS,yetnoas: In practice the comparisons that are constructed 
are those having some meaning in terms of the experimental variables; 
whether these comparisons are orthogonal or not makes little or no dif- 
ference. 
The general form of the hypothesis tested by a comparison is 
Hj. Хош 0 20,- 0 
Hy: Хсуш = 0, 
where the c,’s are the coefficients in the comparison. The statistic used in 
making the test has the general form 
Pei: 
© (nSc2)(MSerror) 
The numerator of this F ratio has one degree of freedom. The degrees of 
freedom for the denominator are those of MSerror. | nu 
When a large number of comparisons is made following a significant 
over-all F, some of the decisions which reject H, may be due to type 1 error. 
For example, if 5 independent tests are each made at the 05 level, үл proba- 
bility of a type 1 error inone or moreof the five decisions 151 — (95) = 23. 
If 10 independent tests are made, the probability of a type 1 error's 
occurring in one or more of the decisions is 1 — (959 = .40. In general, 
ifa = .05 and a series of m independent tests are made, then the probability 
of one or more type | errors in the series of m tests 15 obtained by subtracting 
the last term of the expansion of (.05 + .95)" from unity. Equivalently, 
one may add all terms in the expansion except the last one. " 
Thus, when the number of comparisons is large, the number of decisions 
that can potentially be wrong owing to type 1 error can be relatively large. 


= 8.70. 


Ш 
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Tukey and Scheffé have developed methods for constructing simultaneous 
confidence intervals which avoid the pitfall of permitting the type | error 
to become excessively large. These methods as well as others for dealing 
with multiple comparisons are discussed in Secs. 3.8 and 3.9. 


3.7 Use of Orthogonal Components in Tests for Trend 


The total variation of the treatments may be subdivided in many different 
ways. Depending upon the nature of the experimental variables and the 
purpose of the experiment, some subdivisions may be meaningful and others 
may not. The meaningful comparisons need not be the orthogonal ones. 
If the treatments form a series of equal steps along an ordered scale, i.e., 
increasing dosage, intensity, complexity, time, etc., then treatment variation 
may be subdivided into linear, quadratic, cubic, etc., trend components 
through the use of orthogonal polynomials. This method of subdivision 
provides information about the form of the relationship between the 
criterion and the steps along the treatment scale. 

Many different curves may be found to fit a given set of empirical data. 
The use of polynomials of varying degree may or may not be the most 
appropriate choice for the form of the curve. From one point of view this 
choice simplifies certain of the statistical problems. From another point 
of view, the use of higher-degree polynomials may be scientifically meaning- 
less, Polynomials are, however, quite useful in showing the general form of 
relationships. Within a limited range of values, polynomials can be used to 
approximate curves which are actually exponential or logarithmic in form. 

Caution is required in the use of the methods to be described in this 
section. The nature of the data and the purpose of the experiment must be 
considered in evaluation of the results, 

In regression analysis, a first-degree (linear) equation is used as the first 
approximation to the relationship between two variables, Second-degree 
(quadratic), third-degree (cubic), and higher-degree equations are used if 
the fit of the linear relationship does not prove to be satisfactory. When 
there are equal numbers of observations in each of the treatment classes, 
and when the treatment classes form equal steps along an ordered scale, the 
work of finding the degree of the best fitting curve is simplified by use of 
comparisons corresponding to these curves. Once the degree of the best 
fitting polynomial is found, the Tegression coefficients for this curve are also 
readily obtained. Extensive tables exist giving the coefficients for the com- 
parisons corresponding to regression equations of varying degree. The 
most complete set of coefficients is in the work of Anderson and Houseman 
(1942). The Fisher and Yates tables (1953) also have an adequate set. A 
numerical example will be used to illustrate the application of individual 
components of variation to the problem of finding the degree of the best- 
fitting curve. (Tests for the best-fitting curve are also called tests for trend.) 

Suppose that an experimenter is interested in approximating the form of 
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the relationship between the degree of complexity of a visual display (i.e., 
an instrument panel containing a series of dials) and the reaction time of 
subjects in responding to the displacement of one or more of the dials. 
After deciding upon a definition of the complexity scale, the experimenter 
constructs six displays representing six equal steps along this scale. The 
treatments in this experiment correspond to the degree of complexity of the 
displays. Random samples of 10 subjects are assigned to each of the dis- 
plays; each sample is observed under only one of the displays. The cri- 
terion used in the analysis is the mean reaction time (or some transformation 
thereof) for each subject to a series of trials which are as comparable as the 
displays permit. 
Table 3.7-1 Numerical Example 
nr талан к= кы 


Complexity TE NES 
of " 1 2 3 а 5 6 | =6; n= 
i Total 
display „24 m We. 2 |. ГЕ 
Т, 100 110 120 180 190 210 | 910 =G_ 
Q хх? 1180 1210 1600 3500 3810 4610 15,910 = Xxx? 
Т, 100 11.0 120 180 190 210 
(1) = G?/kn = (910)2/60 = 13,801.67 
(2) = xxx? = 15,910 
(3) = (ET3)n = (100° + 110° + --- + 2102)/10 = 14,910.00 


(ii) 
SSaisptays = (3) — (1) = 1108.33 
SSerror = (2) - (3) = 1000.00 


SStotar = (2) — (1) = 2108.33 


Summary data for this experiment are presented in Table 3.7-1. Each 
the 10 criterion scores for scale /. 


Т, in this table represents the sum over 3 
The sum of the squares of the 10 criterion scores under each scale is also 
given in parti ofthis table. Stepsin the computation of the sums of squares 


are given in part ii. 


Table 3.7-2 Summary of Analysis of Variance 


Source of variation | SS df | М5 Е 
11;975* 


221.67 
18.52 


Displays 1108.33 
Experimental error 1000.00 


Total 2108.33 


жж F 94(5,54) = 3.38 


appears in Table 3.7-2. Since there are six 


dom for the displays are 6 — 1 = 5. Ten sub- 
he sum of squares within a 


„Тһе analysis of varianc 
displays, the degrees of free 
jects were used under each display. Hence t 
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single display has degrees of freedom equal to 9; the pooled within-display 
variation has 6 x 9 = 54 degrees of freedom. The pooled within-display 
variation defines the experimental error. To test the hypothesis that the 
mean reaction times (or the means of the transformed measures) for the 
displays are equal, 
T MSaispiays 2% 221.67 91:972 
MSerror 18.52 


The critical value for this test, using the .01 level of significance, is 
Е (5,54) = 3.38. 


Clearly the data indicate that reaction times for the displays differ. Inspec- 
tion of the T/'s in Table 3.7-1 shows that the greater the degree of complexity 
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Figure 3.1 


the slower the reaction time. The means corresponding to the 7/8 are 
plotted in Fig. 3.1. This figure shows a strong linear relationship between 
display complexity and reaction time. There is also the suggestion that an 
S-shaped curve might be the best-fitting curve; an S-shaped curve corre- 
sponds to an equation having degree 3 (cubic). 

In the upper left part of Table 3.7-3 are the coefficients for linear, quad- 
ratic, and cubic comparisons for k = 6. These coefficients were obtained 
from Table B.10. Notice that the sum of the linear coefficients is zero, 
that each coefficient differs by 2 units from its neighbor, and that only once 
in the sequence from —5 to +5 does the sign change. For the coefficients 
defining the quadratic comparison, the signs change twice in the sequence 
from +5 to +5; in the cubic comparison there are three changes of sign in 
the sequence from —5 to +5. The number of times the signs change 
determines the degree of the polynomial 
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The numerical value of the linear comparison for these data is 
Cun = (—5)(100) + (—3)(110) + (—1)(120) 
+(1)(180) + (3)(190) + (5)(210) 


= 850. 
Since D = n¥c*, the component of variation corresponding to this compari- 
son has the form 


Clin 850? 
рип 10[(—5)? + (—3)? + (—1)? + 7 + 3* + 5°] 
2 
BEE as ТУН 
10(70) 


Thus the variation due to the linear trend in the data is 1032.14 units. From 
Table 3.7-2 it is seen that the total variation in the displays is 1108.33 units. 


Table 3.7-3 Tests for Trend 


а) (2) (3) (4) (9) © 
Т): 100 110 120 180 190 210 Ec С р-птх% CD 


" Linear =5) ЗА DIS 70 850 700 1032.14 
(i) Quadratic $ ЕТЕ 50 840 29.76 
Cubic —§ 17 47-41-71 5 180 -250 1800 34,72 
1096.62 
1 4 Tuus ass 
Test for linear trend: =—185 55.73 
5 29.76 
(іі i : ааа EL E 
) Test for quadratic trend: F 18.52 1.61 
34.72 
i : RECEN 
Test for cubic trend: F 18.52 87 


_____ wn D 
ж“ F 001,54) = 7.14 


action time 


Hence 1032.14/1108.33 = 93 per cent of the variation in the re 
tion. The 


for the displays may be predicted from a linear regression equa 

test for linear trend is given by 

. linear component _ 103214 55,73. 

MSerror 18.52 

The critical value for a .01-level test is Ёо9(1./) 

freedom for MSerror. In this case f = 54, and 

Clearly the linear trend is statistically significant. 
1108.33 — 1032.14 = 76.19 

ear regression equation. There 

bic trend might be statistically 


where fis the degrees of 
the critical value is 7.14. 
However, 


units of variation are not predicted by the lin 
is still the possibility that the quadratic or cu 
Significant. 
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An over-all measure of deviations from linearity is given by 
SSnoniin = SSaispiays — SSiin = 1108.33 — 1032.14 = 76.19. 


The degrees of freedom for this source of variation аге k — 2. Ап over-all 
test for nonlinearity is given by 
SSnontin/(k — 2) 76.19/4 19.05 
MSerror 18.52 18.52 

For a .01-level test, the critical value is Ғ.;(4,54) = 3.68. Since the F ratio 
does not exceed this critical value, the data do not indicate that there are 
significant deviations from linearity. However, when the degrees of 
freedom for MS,,,;, are large, the over-all test may mask a significant 
higher-order component. In spite of the fact that the over-all test indicates 
no significant deviations from linearity, if there is a priori evidence that a 
higher-order component might be meaningful, tests may continue beyond 
the linear component. Caution is required in the interpretation of signifi- 
cant higher-order trend components when the over-all test for nonlinearity 
indicates no significant nonlinear trend. 

The quadratic comparison is orthogonal to the linear comparison, since 
the sum of the products of corresponding coefficients is zero, i.e., 


(—5)(5) + (—3)(—1) + (—1)(—4) + @)(—4) + (3)(—1) + (5)(5) = 0. 
Therefore the quadratic component of variation is part of the 76.19 units of 
variation which is not predicted by the linear component. The computa- 
tion of this component is similar to the computation of the linear component, 
the quadratic coefficients replacing the linear coefficients in the calculation 
of Cand D. The numerical value of the quadratic component is 29.76. 
This is the increase in predictability that would accrue for the sample data by 
using a second-degree instead of a first-degree equation. А test on whether 


or not this increase in predictability is significantly greater than zero uses 
the statistic 


F 1.03. 


F= quadratic component 29.76 

MSerror 18.52 

The critical value for a .01-level test is Е (1, f); in this case the critical 

value is 7.14. The F statistic for quadratic trend is 1.61. Hence the 

increase in predictability due to the quadratic component is not significantly 
different from zero. 

_ It is readily verified that the cubic comparison is orthogonal to both the 
linear and quadratic components. Thus, the cubic component is part of the 
1108.33 — 1032.14 — 29.76 = 46.43 units of variation not predicted by 
the linear or quadratic trends. Тһе component of variation corresponding 
to the cubic trend is 34.72. The F statistic for this component is 1.87; the 
critical value for a .01-level test is 7.14. Thus, the sample data indicate that 


the cubic component does not increase the predictability by an amount 
which is significantly different from zero. 


1.61. 
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Of the total of 1108.33 units of variation due to differences in complexity 
of the displays, 1096.62 units are predictable from the linear, quadratic, and 
cubic components. The remaining variation is due to higher-order com- 
ponents, none of which would be significantly different from zero. In 
summary, the linear trend appears to be the only trend that is significantly 
greater than zero. Hence a first-degree equation (linear equation) is the 
form of the best-fitting curve. 

In this case the linear equation will have the form 


X = bK +a, 


where Х = predicted reaction time, 
K = degree of complexity of display, 
b, a = regression coefficients. 

Since the degree of complexity of the displays is assumed to be equally 

spaced along a complexity dimension, the degree of complexity may con- 

veniently be indicated by the integers 1, 2,...,6. The sum of squares for 

the complexity variable is given by 

n(k? — К) 

SSpc ay 


where k is the number of displays and n is the number of observations under 
each display. The entry 12 is constant for all values of n and k, The 
regression coefficient b is given by the relation 


D NT. component | NET . 
SS. 55к 


Equivalently, på Ay Cuin 
Diin 


where 4, is a constant which depends upon k; tables of coefficients for the 
orthogonal polynomials will give values of 2,. | 
The symbol 55,, is generally used in regressi 
variation predictable from the linear equation. 
regression coefficient a is given by the relation 


a= X — bK. 


on analysis to indicate the 
The numerical value of the 


Computation of the coefficients for the regression equation is summarized 


in Table 3.7-4. 
The linear correlation between degree of comp’ 


given by 
Т [SSu [103241 _ 79, 
TEN SStotal 2108.33 


lexity and reaction time is 
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The numerical value for SStoia is obtained from Table 3.7-2. The actual 
fit of the regression equation to the points in Fig. 3.1 is shown in Fig. 3.2. 


Table 3.7-4 Computation of Regression Equation 


X = reaction time K = display complexity 
X = G[kn = 15.17 K = (k + 1)/2 = 3.50 
_ п —k) _ (10216 — 6) _ 
55к = Pa ea T RES 175 


БЕСТЕН ПУЛ 
b= ss. = | pag = 2.48 


a = X — bK = 15.17 — (2.43)(3.50) = 6.67 
Regression equation: X = bK + a = 2.43K + 6.67 


The correlation associated with the cubic relationship is - 


es /SSiin + SSquaa + SScupic 
SStotai 
/1032.14 + 29.76 + 34.72 72 
2108.33 Uv 


The tests that have been made indicate that the cubic correlation does not 
differ significantly from the linear correlation. 
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Figure 3.2 


In working with higher-order regression equations, in predicting V from 


K it is convenient to work with the equation in the second of the following | 
orms: 


V — ay + aU + a,U? + aU? 
= Aoko + A46 + Ages + Аё) 
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In this latter form, 


A= Р, heh 

At ae E g= A 010210) 

Ay тте ао - o - E ah 

Ay = T2 &- alu Uy —(U — 0) El 7. 


The numerical values of the 2/8 depend upon the value of К, the number of 
treatments. Numerical values for 4; will be found in tables of the coeffi- 
cients. 


3.8 Use of the Studentized Range Statistic 
Although the hypothesis r, = Ta <<!” = Tp İS generally tested by means 


ofan Fratio, there are other methods for testing this same hypothesis. One 
method is through use of the studentized range statistic, defined by 


hr Tiargest ік. Tsmaliest 


B A MSerror/ 


where n is the number of observations ineach 7. In words, ће studentized 
range statistic is the difference between the largest and smallest treatment 
means (the range of the treatment means) divided by the square root of the 
quantity mean-square experimentalerror over n. A numerically equivalent 


form of the q statistic is given by 


Tiargest — Tsmatlest 


(la ы 
\/пМ5етгог 


where а Т represents a sum of n observations. Computationally it is more 
convenient to compute first the following statistic: 


Ti est — Tsmantest)” 
(2) 13 Cargo aane 
Penis 2nMSerror 


Then q may readily be shown to be equal to 


(3) q = /2Frange- 


Under the hypothesis that т = 72 =` ` ` = Tr and under all the assump- 


tions underlying either model I or model II, the sampling distribution of the 
q Statistic is approximated by the studentized range distribution having 
parameters k = number of treatments and f = degrees of freedom for 
MSeror. The symbol 4 s (K. f) designates the 99th percentile point on the 
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q distribution. Tables of the latter are given in Table B.4. In contrast to 
the over-all F ratio, which uses all the k treatment means to obtain the 
numerator, the q statistic uses only the two most extreme means in the set of 
k. Thus, the q statistic uses less of the information from the experiment 
than does the F statistic. Use of the F statistic generally leads to a more 
powerful test with respect to a broader class of alternative hypotheses than 
does the use of theg statistic. However, there are some alternative hypoth- 
eses for which the q statistic leads to a more powerful test. Whether or not 
the two tests lead to the same decision with respect to the hypothesis being 
tested depends upon the distributions in the populations from which the 


Table 3.8-1 Numerical Example 


Treatments...) а b c d e 
PEEN 10 14 18 14 14 k=5;n=4 
Pahora d a - | 2.50 | 3.50 | 4.50 | 3.50 | 3.50 
Source 55 df MS Е 
Treatments 8.00 4 | 2.00 “4.00 
Experiment error 7.50] 15 | 0.50 
Total 15.50 57 


= лт „ШИЛ лн ы. a) ә. 
F (4,15) = 4.89 


means were obtained. If the distributions are each М№Ми,о), the use of either 
the F or the q statistic will lead to the same decision. In other cases the 
decisions reached may differ. 

The numerical example summarized in Table 3.8-1 provides an illustration 
of a case in which the two tests lead to different decisions. Each Т, in this 
table represents the sum over 4 observations (n = 4) under each of 5 treat- 
ments (k = 5). Since the observed over-all F ratio (4.00) does not exceed 
the critical value (4.89) for a .01-level test, the data do not contradict the 
hypothesis that the treatment effects are all equal. The 4 statistic as com- 
puted from (1) is 

4.50 — 2.50 
ЕТІ = 5.67. 


The critical value for a .01-level test is 45,15) = 5.56. Since the ob- 
served q statistic exceeds the critical value, the data contradict the hypothesis 
that the treatment effects are all equal. In this case the F statistic and the 
q statistic lead to conflicting decisions. This is so because the means for 
treatments 6, d, and е fall at a single point. 
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In contrast to the example in Table 3.8-1, consider the example in Table 
3.82. In this latter table the means tend to be more evenly distributed 
between the highest and lowest values. $$, for this case is larger than 
it is for the case in which the means concentrate at a single point within the 
range. The experimental error in this example is numerically equal to that 
in Table 3.8-1. The over-all F value exceeds the critical value for a .01- 
level test. Since the ranges are the same in the two examples and since the 
error is also the same, there is no numerical change in the value of the g 
Statistic. If, in practice, the over-all F leads to nonrejection of the hy- 
pothesis being tested but the q test leads to rejection of the hypothesis being 


Table 3.8-2 Numerical Example 


Treatments . . А а b c d e 
DOW | 10 | 12 | 18 | 16 | 14 k=5;n=4 
Тт ИЙ | 2.50 | 300 | 450 | 400 | 3.50 
Source SS df | MS F 
‘Treatments 10.00 4 2.50 | 500** 
Experimental error 7.50| 15 | 0.50 
Total 17.50 Ед 


** F (4,15) = 4.89 


tested, the experimenter should examine his data quite carefully before 
attempting any interpretation. In most cases, the F test is more powerful 


than the corresponding q test. А ; 

A numerical example will be used to illustrate the marked difference in 
sampling distributions for statistics constructed in a way which takes into 
account the order of magnitude of the observed outcomes. Consider a 


Population having the following five elements: 
Element x 


ч шоок, 
t2 
о 


The collection of all possible samples of sizen = 3, the sequence within the 
sample being disregarded, that can be constructed by sampling without 
replacement after each draw is enumerated in part i of Table 3.8-3. 

The statistic d; is defined to be the difference between the largest and the 
smallest of the sample values. The statistic d, is the difference between the 
next largest and the smallest. Under a sampling plan which gives each of 
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the samples in part i an equal chance of being drawn, the sampling distribu- 
tions for d, and d, are the cumulative relative-frequency distributions 
in part ii, From the latter, one obtains the following probability state- 


p P(d, > 10) = .10, 
P(d, > 10) = .70. 
In words, the probability of large values for d, is larger than the corre- 
sponding probability for а. 
Table 3.8-3 Sampling Distributions for Ordered Differences 


Sample а, а, Sample | а, а, 
10, 15, 20 5 10 10; 25,:30-|--15-| 20 
10, 15, 25 5 15 15, 20, 25 | 22 10 
(i) 10, 15, 30 5 20 15, 20, 30 5 15 
10, 20, 25 10 15 15, 25, 30 10 15 
10, 20, 30 10 20 20, 25, 30 5 10 
Cumulative Cumulative 
4 rel. frequency dg rel. frequency 
(ii) 20 1.00 20111 1.00 
151 1.00 151111 70 
10111 .90 10111 .30 
5ІШІ 60 5 .00 


The principle that emerges is this: When one attempts to draw inferences 
from statistics determined by taking into account the order of magnitude 
of the observations within an experiment, the appropriate sampling dis- 
tribution depends upon the number of ordered steps between the observa- 
tions from which the statistic was computed. 

A modified q statistic is particularly useful in probing the nature of the 
differences between treatment means following a significant over-all F. 
The procedure to be described is known as the Newman-Keuls method. 
For this purpose, the q, statistic will be used, where r is the number of steps 
two means (or totals) are apart on an ordered scale. Consider the following 
treatment totals arranged in increasing order of magnitude: 


Order...) 1 2 3 4 5 6 1 


| Тау | Тә, | Тә | Tw | Ti | Ту | Ті 

In this notation 7, designates the smallest treatment total and Те the 
largest treatment total. T is defined as being seven steps from Tu» 
T'e) is six steps from Ta), etc. ; Ті) is six steps from 7, Т); is five steps from 
Ti, еіс. In general, the number of steps between the ordered totals Tin 
andT(;isj — i--1. Thus, 

_ Та = Тау. 


4; ; 
V"MSerror 
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there is only опе 4; statistic. There are, however, two qe statistics, 


— Te — Tw 


qe = === and 4 у 
i у nMSerror A/nMSerror 


q corresponds to the ordinary studentized range statistic when k = 7. 
4» Where ғ is less than К, is a modified or truncated studentized range 
statistic. Critical values for q, are obtained from tables of the studentized 
range statistic by setting r equal to the range. For example, the critical 
value for qs in a .01-level test is 4 (6, f); the corresponding critical value for 
qa i$ q.o(4,f ), Where fis the degrees of freedom for MSerror. (Tables of the 
q Statistic are given in Table B.4.) 

Rather than working directly with the critical value for the q, statistic, 
it is more convenient, in making a large number of tests, to obtain a critical 
value for the difference between two totals. For example, 


То) — То) = 4у nMSerror. 


Since the critical value for gg is qı—a(6,f), the critical value for Ті) — То) 
willbe q,_,(6,f) VnMSerror. Іп general, the critical value for the difference 
between two treatment totals which are r steps apart on an ordered scale 
will be q,_,(,f) VnMSerror. l 

The use of the q, statistic in testing the difference between all pairs of 
means following a significant over-all F will be illustrated by the numerical 
example in Table 3.8-4. The summary of the analysis of variance appears 
in parti. Since the observed F ratio (9.40) exceeds the critical value for a 
01-level test, the hypothesis that the effects of the seven treatments are all 
equal is rejected. The seven treatments are designated by the symbols a 
through g. The totals of the five observations under each of the treatments 
are arranged in increasing order of magnitude in part ii of this table. For 
example, treatment с has the smallest total; treatment f has the largest total. 
A table of differences between the treatment totals also appears in part 1t. 
For example, the entry in column а, row с, iT, — Т, = 12 — 10:- 2; the 
entry in column g, row a, is T, — T, = 22 — 12 = 10. In general, the 
entry in column j, row i, is T, — Т. Only the entries shown in this table 
need be computed. Nw е 

The critical values for ће 4, statistic (when « = .01) are given in part iii 
of Table 3.8-4. Since mean square for experimental error has 28 degrees of 
freedom, f = 28. The entry 3.91 is obtained from tables of the studentized 
Tange statistic in which k = 2 and f = 28. The entry 4.48 is obtained from 


the same row of the tables, but from the column inwhichk = 3. Similarly 


the entry 4.84 is obtained from the column in which k =4. These entries 


are critical values forq,; to obtain the critical value for the difference between 
treatment totals which are r steps apart, the critical values are multiplied by 


A/nMSerror = 1/5(80) = 4.00 = 2.00. 
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Thus, the entry 7.82 = .o9(2,28)V NMSerror = (3.91)(2.00). Similarly the 
entry 10.90 = 4:(7,28)У/пМбего: = (5.45)(2.00). Hence the critical value 
for the difference between two treatment totals that are r = 2 steps apart 
is 7.82, whereas the critical value for the difference which is r = 7 steps 
apartis 10.90, Differences between totals an intermediate number of steps 
apart have critical values between these two limits. 

The farther apart two means (or totals) are on an ordered scale, the larger 


Table 3.8-4 Tests on All Ordered Pairs of Means 


(с —7, n=5) 
Source SS df | MS F 
Treatments 45.09 6 | 7.52 |9.40 
(i) Experimental error | 22.40) 28 .80 
Total 67.49 | 34 F 906,28) = 3.53 
Order Аа 1 2 3 4 5) | 6 Т) 
Treatments іп И 
order of 7;...... c a d b g e if 
Ih Ne close dd uo 10 12 13 18 22 24 29 
Gi) c a d b g e ji 
c — 2 8 12 14 15 
a — 1 6 10 12 13 
d — 5 9 11 12 
b = 4 6 7 
& — 2 3 
e — 1 
if E 
scita decr c eni duae pai co L 
Truncated ranger... 2 3 5 6 Т 
ЕК А 3.91 4.48 4.84 5.09 5.28 5.45 


4.ө(ғ,28) VnMSerror «| 7.82 8.96 9.68 10.18 | 10.56 | 10.90 


беч T Dc 


(iv) 


Svat C & R8 ^ 


USE OF THE STUDENTIZED RANGE STATISTIC 83 


the difference between them must be before this difference exceeds its critical 
value. Thus, if one examines the data obtained from an experiment and 
decides to test the difference between the largest and the smallest mean in а 
set of k means, the critical value for a test of this kind is larger than the critical 
value for two means which are adjacent to each other on an ordered scale. 
The larger critical value is required because the sampling distribution of the 
difference between the largest and smallest means in a set of k will have a 
greater relative frequency of more extreme values than will the sampling 
distribution of two adjacent means in a set of k. 

' There is a prescribed sequence in which tests on the differences between 
treatment totals in part ii must be made. 

1. The first test is on the difference in the upper right-hand corner. This 
difference is То — Ті) = T, — T, = 15; the totals here аге r = 7 steps 
apart, The critical value for a .01-level test is 10.90. Since the observed 
difference exceeds 10.90, the hypothesis that ту — т, is rejected. Two 
asterisks appear in cell (c, f^) in part iv to indicate that this hypothesis is 
rejected. If this hypothesis is not rejected, no additional tests are made. 

2. Tests are now made successively on the entries in row c, proceeding 
from right to left until the first nonsignificant difference is found. The test 
on the difference Т) — Ta) = T, — T, = 14 has the critical value 10.56, 
since these totals are ғ = 6 steps apart. The test on the difference 


Tu, — Та = T, - Te = 12 


has the critical yalue 10.18, since these totals are r = 5 steps apart. The 
difference T, — Ta) = T, — T, = 8 has the critical value 9.68, since these 
totals arer = 4stepsapart. This last difference does not exceed its critical 
value. Hence no additional tests are made in row с. Since the last 
Significant entry occurs in column g, no additional tests made will go beyond 
column g. | 

3. The next test is made on the extreme right-hand entry in row а. This 
entry is T, — Tu, = T, — T, = 12. Because these totals arer = 6 steps 
apart, the critical value is 10.56. Thus, the hypothesis that т, = 7415 re- 
jected. If this hypothesis had not been rejected, no additional tests would 
be made. , í 

4. Tests are continued from right to left in row а either until a non- 
Significant entry is obtained or until column g is reached, whichever 2 
first. The entry Те — Tie) = T, — Ta = 12 has the critical value 194 6 
Since these totals аге r = 5stepsapart. The entry То Та = T, — Ta = 
has the critical value 9.68, since these totals are r = 4 steps apart. Since 
No tests are made beyond column g, no additional tests are made in 
Tow a. TBN 

5. Tests now proceed from right to left in row d. Tests in this ү 
Continue until either a nonsignificant entry 1s reached or the point at ы Ss 
tests were stopped in the preceding row is reached, whichever occurs first. 
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The first entry at the right in row dis T; — Та) = T, — Ta = 12; this entry 
has the critical value 10.18. The next entry in row d, 


La А a =; 


has the critical value 9.68, since the totals аге г = 4 steps apart. The next 
entry, Ti, — Ti = T, — T, = 9, has the critical value 8.96, since these totals 
arer = 3 steps apart. Since no tests are made beyond column g, this is the 
last test made in row d. 

6. Tests now proceed from right to left in row b until either a nonsignifi- 
cant entry occurs or the column at which tests were terminated in the pre- 
ceding row is reached, whichever occurs first. The first entry on the right 
in row b, T, — Tu, = T, — Т, = 7, has a critical value of 9.68, since the 
totalsarer = 4stepsapart. Thehypothesis that, = v, cannot be rejected. 
No additional tests are made in row b or in any row below it. 

The differences which proved to be significant at the .01 level are indicated 
by asterisks in part iv of Table 3.8-3. The information in part iv may be 
summarized schematically as follows: 


cadbgef. 


Treatments underlined by a common line do not differ from each other; 
treatments not underlined by a common line do differ. Thus, treatment f 
differs from treatments c, a, and d, but treatment f does not differ from 
treatments e, g, and b. Similarly treatment e differs from c, a, and d but 
does not differ from b, g, f. 

This sequence for making the tests prevents one from arriving at contra- 
dictory decisions of the following type: Suppose there are four means in 
an ordered set and that the different 7,4, — Ty) is close to being significant but 
does not quite “make it.” Further suppose that the difference Ti) — Tw 
is just larger than the appropriate critical value. In this case one might 
be tempted to conclude that there is no significant difference between the 
largest and the smallest means in the set but that there is a significant 
difference between the next to the largest and the smallest. Geometrically 
this conclusion would be equivalent to inferring that the distance between 
the largest and the smallest of four means is zero but that the distance from 
the smallest to the next to the largest is greater than zero. Yet the latter 
distance has to be smaller than the former. Clearly this kind of inference 
leads to a contradiction. 

In general, if the sequence indicated above is followed in making tests on 


all possible pairs of ordered means, the patterns of significant differences 
indicated below will not occur: 


@) Q (3) (4) (5) (D 0) @) @ (9 
— ЕНІ ырза е) 
5 ж' ^+» 
an & les 
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That is, between any two asterisks in the same row or column there can be 
no gaps (nonsignificant differences). Further, if the extreme position at the 
right of a row is a gap, then there can be no asterisks in that row or any row 
below that row. 

It is of interest to compare the critical values for tests on ordered means 
with tests on individual components made through use of the F statistic. 
Since q = V2F, the critical value for a test on individual components Гог 
the example in Table 3.8-3 would be (in comparable units) 


УЭЕ 441,28) = V2(7.64) = 3.91. 


This is the critical value for totals which are two steps apart. Hence use of 
the F statistic is equivalent to using the 3.91 as the critical value for all q’s, 
or using the value 7.82 as the critical value for differences between treatment 
totals. The critical values for the Newman-Keuls procedure range from 
7.82 to 10.90. Hence the use of the F statistic leads to more significant 
results than does the use of the q, statistic. 

If the meaningful comparisons are relatively few in number and are 
planned before the data are obtained, the F test associated with individual 
components of variation should be used. This type of comparison is called 
ana priori comparison in contrast to comparisons made after inspection of 
the experimental data; the latter are called a posteriori or post-mortem 
comparisons. Thea priori type is always justified whether or not the over- 
all F is significant. If the k treatments fall into one or more natural 
groupings in terms of the treatments, tests on ordered differences may be 
made separately within each of the groupings. Other procedures for 
making a posteriori comparisons are discussed in the next section. 

In contrast with the procedures to be described in the next section, the 
level of significance for the Newman-Keuls „procedure is considered 
individually with respect to each test. Tests in the next section have 4 
different orientation with respect to the level of significance. 


3.9 Alternative Procedures for Making A Posteriori Tests 


The Newman-Keuls procedure, described in the last section, keeps the 
level of significance equal to « for all ordered pairs, no matter c dd 
Steps apart the means may be. However, the level of significance wit 
respect to the collection of all tests made, considered as à single test, is 
considerably lower than х. Thus, the power of the collection of all tests 
made is less than that associated with an ordinary a-level test. Duncan 
(1955) has developed a procedure which uses a protection level of « for the 
collection of tests, rather than an « level for the individual tests. ni 

[Scheffé (1959, р. 78) takes issue with the principles d the 
development of the sampling distributions which Duncan uses in obtaining 


the critical values for tests.] 
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In terms of individual tests, a protection level with x = .01 provides a 
level of significance equal to .01 for means which differ by two ordered 
steps; for means differing by three steps, the level of significance is equal to 
1 — (.99)# = .02; for means differing by four steps, the level of significance 
is equal to 1 — (.99)? = .03. Thus, with protection level equal to х, the 
level of significance for individual tests is numerically higher than « when 
the pairs are more than two steps apart. 

The statistic used in the Duncan procedure is the same as that used in 
the Newman-Keuls test, namely, g,, where r is the number of steps apart two 
means or totals are in an ordered sequence. The steps followed in using 
the Duncan procedure are identical to those followed in the Newman-Keuls 
procedure. However, the critical values for the q, Statistic are obtained 
from tables prepared by Duncan for use with protection levels. 

For the case k = 7 and degrees of freedom for experimental error equal 
to 28, critical values for the .01-level Newman-Keuls tests and the .01-level 
protection level are as follows: 


ҚАМЫ ORTU | 2 | 3 4 | 5 6 7 
Newman-Keuls... | 3.91 | 448 | 4.84 | 509 | 528 545 
Duncan «$n vs 3.91 | 4.08 | 4.18 | 4.28 | 4.34 | 4.39 


When k = 2, the two procedures have identical critical values, For values 
of k larger than 2, the Duncan procedure has the smaller critical value. The 
larger the value of r, the larger the difference between the critical values for 
the two procedures. Thus, on the average, a larger difference between two 
means (or two totals) is required for statistical significance under the 
Newman-Keuls procedure. 

To illustrate the difference in size of the type 1 error associated with 
individual tests, which disregards the order aspect, the q, statistic may be 
transformed into an F statistic by means of the relation 


4, = J2F or к= 2 Я 
For the case discussed in the preceding paragraph, 5.45 is equivalent to an 
F of 14.85, and 4.39 is equivalent to an F of 8.63. Relative to the critical 
value for an individual comparison made by means of an F statistic, the 
type 1 error associated with the difference between two totals which are 
Seven steps apart is as follows: 


RS RE ee Г ИШИ 


Critical F value | “Actual” « 


Individual comparison 7.64 .01 
Newman-Keuls 14.85 .0005 
Duncan 8.63 007 
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In other words, if a .01-level test were made on the difference between two 
means that are seven steps apart on an ordered scale, assuming that М5 
has 28 degrees of freedom, and if the order of the means in the sample were 
disregarded, the critical value would be F = 7.64. If, however, the order 
were taken into account, the equivalent critical value would be 14.85. With 
a .01 protection level, the equivalent critical value would be 8.63. 

A considerably more conservative procedure in terms of keeping the type 
1 error small is the use of q, _ ,(k,f) as the critical value for all tests, no matter 
how many steps apart the means may be. Thus, instead of changing the 
critical value as a function of the number of steps two means are apart on 
an ordered scale, the critical value for the maximum number of steps is used 
for all tests. This approach, suggested by Fisher, has been studied and 
extended by Tukey and will be called the Tukey (a) procedure. [This pro- 
cedure has also been called the honestly significant difference (hsd) pro- 
cedure.] Compared with the Newman-Keuls and Duncan approaches, 
fewer significant differences will be obtained. For the example considered 
in Table 3.8-4, the critical value for the q, statistic would be 5.45 for all tests; 
equivalently, the critical value for the difference between two treatment 
totals would be 10.90 for all differences. If the Tukey (a) test were used 
with the data in Table 3.8-4, part iv of this table would have the following 
form. 

c a d b g e f 


c жж жж жж 
а жж 
а жж жж 


The power of Tukey (а) tests is lower than those of the Newman-Keuls and 
the Duncan procedures. Use of the Tukey (a) procedure is not necessarily 
limited to tests on differences between pairs of means; it may also be used 
in making comparisons involving three or more means. The Tukey (a) 
procedure has this general property: all tests on differences between pairs 
have a level of significance which is at most equal toa. — г 

Tukey has also proposed a second procedure, which is a compromise 
between the Tukey (a) tests and the Newman-Keuls procedure. The statis- 
tic employed in making what will be called Tukey (0) tests is again the ГА 
statistic. However, the critical value is the average for the corresponding 
value in the Newman-Keuls tests and the critical value for the Tukey (а) 
tests. In symbols, this critical value is 


qi. f) + qi-rsf) И 
2 


where k is the number of means in the set and r is the number of ia 
between the two means being compared. For example, if k — 7 an 
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/ = 28, the critical value for a .01-level test for two means which are four 
steps apart is 
5.45 + 4.84 
2 


= 5.14. 


A procedure for making all possible comparisons, not specifically com- 
parisons involving two means, has been developed by Scheffé. An F 
Statistic corresponding to a component of variation is computed, but the 
critical value for this component is (k — ПЕ (К —1,f). АП tests use 
this critical value. For example, if k = 7, = 28, and х = .01, the critical 
value for Scheffé tests is 6[F,,(6,28)] = 6(3.53) = 21.18. Іп terms of 
the q statistic this is equivalent to a critical value of 6.51. If Scheffé 
tests were made on the data in Table 3.8-4, the critical value would be 
(6.51) V nMS,,,,, = 13.02 for all differences between treatment totals. The 
statistically significant differences using this critical value are as follows. 


с а а b g e Ji 


c жж жж 


Thus, the Scheffé method applied to testing differences between all possible 
pairs is even more conservative with respect to type | errors than is the 
Tukey (a) method. 

The Scheffé approach has this optimum property: the type | error is at 
most « for any of the possible comparisons, Іп the origina! development, 
Scheffé was concerned with constructing a set of simultaneous confidence 
intervals on all the comparisons within a subdivision of the entire experi- 
ment. Before a set of simultaneous confidence intervals is considered to 
be true, each of the separate statements must be true. If any one of the 
confidence intervals in the set is false, then the confidence statement is 
considered to be false. The simultaneous-confidence-interval approach 
can be translated into a procedure for making all possible tests on com- 
parisons. This translation has been outlined above. 

The data in Table 3.8-4 will be used to summarize the various methods 
for making a posteriori tests. For == 01, the critical values for 
differences between pairs of ordered totals are as follows: 


reu MNT a __ 


Method 2] 3 4 5 6 7 
Scheffé 13.02 13.02 1302 13.02 1302 13.02 
Тикеу (а) 100 10.0 10.90 10.90 10.90 10.90 
Tukey (0) 936 993 1029 1054 1074 10.90 
Newman-Keuls 782 8.96 9.68 1018 10.56 10.90 
Duncan 7.82 816 836 856 868 8.78 
Individual comparisons 7.82 7.82 7.82 7.82 7.82 7.82 
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The last method is primarily for meaningful comparisons planned prior to 
inspection of the data; it is included in this summary only for purposes of 
illustration. The Scheffé method is clearly the most conservative with 
respect to type 1 error; this method will lead to the smallest number of 
significant differences. In making tests on differences between all possible 
pairs of means it will yield too few significant results. The Tukey (a) test 
will also tend to yield too few significant results. Because the Tukey (a) 
test is applicable in a relatively broad class of situations, and because it is 
simple to apply, there is much to recommend the Tukey (a) test for general 
use in making a posteriori tests. 


3.10 Comparing All Means with a Control 


If one of the k treatments in an experiment represents a control condition, 
the experimenter is generally interested in comparing each treatment with 
the control condition, regardless of the outcome of the over-all F. There 
are k — | comparisons of this kind. Rather than setting a level of signif- 
icance equal to ж for each of the tests, the experimenter may want to have a 
level equal to х for the collection of the k — 1 decisions, considered as a 
single decision summarizing the outcomes. $ 

Since each of the tests uses the same information on the control condition 
and a common estimate of experimental error, the tests are not independent. 
Dunnett (1955) has derived the sampling distribution for a t statistic appro- 
priate for use when level of significance х is desired for the set of all com- 
parisons between several treatments and a control. The parameters of 
Dunnett's distribution for the / statistic are: 


k = number of treatments (including the control), 
df = degrees of freedom for MSerror- 
The approach used by Dunnett is a special case of the problem of handling 
multiple comparisons by constructing a joint confidence interval on the set 
of all relevant comparisons. 
The numerical example in 
application of Dunnett’s t statistic. 


Table 3.10-1 will be used to illustrate the 
In this experiment there are four treat- 
ment conditions (k = 4); three observations (n — 3) are made under each 
of the treatment conditions. One of the treatments represents a standard 
manufacturing process; the other three represent different CE A 
manufacturing the same product. The criterion measure In case is s 
index of quality of the manufactured product. In this case the oer 
F = 7.05 exceeds the critical value for a 05-level test. The / statistic for 
the difference between method / and the standard method is 


mon A 
— J2MSerror/ 


The critical value for the collection of k — 1 statistics of this form that may 
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be computed is obtained from the Dunnett tables given in Table B.6. For 
the data in Table 3.10-1 the critical value for a two-tailed .05-level test is 
+2.94 [that is, f.975(8) = 2.94]. This value is found under the column 


Table 3.10-1 Numerical Example 


Methods 
Standard I II ІП 

55 55 55 50 

47 64 49 44 | n=3;k=4 

48 64 52 41 
T; 150 183 156 135 | G = 624 
У(Х?) 7538 11,217 | 8130 6117 | (ХХ?) = 33,002 
T; 50 61 52 45 


AN жен rE IB e uos ЕСТЕ ПОЛ | 

(1) = G?/kn = (624)2/12 = 32,448.00 

(2) = ХХХ?) = 33,002 

(3) -(“Т)/п = (150° + 183° + 156° + 135%)/3 = 32,850.00 
SSmethoas = (3) — (1) = 402.00 
SSerror Lx Q) = (3) = 152.00 
SStotar = (2) — (1) = 554.00 

E— se гил mse Neen е 


Source of variation SS df MS T 
Methods 402.00 3 134.00 | 7.05 
Experimental error 152.00 8 19.00 

Total 554.00 11 


Ea Шаа). 
Е.5(3,8) = 4.07 


headed 4 and the row corresponding to degrees of freedom equal (о 8. For 
example, in comparing method I with the standard, 

61 — 50 11 

t=- = —— = 309. 

\/219)/3 3.56 
Since the observed / statistic exceeds the critical value, the hypothesis that 
the two methods of manufacturing yield products having equal average 
quality index 15 rejected. The level of significance for this single test is not 
.05; it is approximately .02. The critical value of 2.94 is associated with 
the collection of the three tests that would be made. The other two tests are 


52— 50 

= = .56, 
3.56 
ДЫК 

= шн 
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These three tests may be summarized in the statement that method I differs 
significantly from the standard but methods IL and Ш do not. This sum- 
mary decision has significance level equal to .05. In the long run the 
summary decision reached by the procedures followed will have the 
equivalent of a type 1 error equal to, at most, .05, 

Rather than working with a series of tests of significance, the experimenter 
may construct a series of confidence intervals. For this purpose it is con- 
venient to introduce the concept of what Tukey has called an allowance, 
designated by the symbol A. In this context an allowance is defined by 


Aaa MSE 


where 1, („эу is a value obtained from the Dunnett tables. For the data 
in Table 3.10-1, with « = .05, A = 2.94(3.56) = 10.5. Тһе general form 
for the lower and upper confidence limits is 


(Те: To) + А. 


For the data in Table 3.10-1, 95 per cent confidence limits for the collection 
of confidence statements on the difference и, — Ko are as follows: 


For method I: (61 — 50) + 10.5 = 5 and 21.5. 
For method II: (52— 50) + 10.5 = —85 and 12.5. 
For method III: (45 — 50) + 10.5 = —15.5 and 5.5. 


The joint (simultaneous) confidence coefficient for the intervals defined by 
thesé limits is .95, The confidence interval for method I takes the form 


5<ш-іш € 21.5. 


In words, the difference between the mean quality indices for method Land 
the standard method is between .5- and 21.5-quality index units. This 
statement and the additional confidence statements for methods П апа Ш 
have а joint confidence coefficient of .95. . 

It is of interest to compare the distribution o tic v 
that of Student's ¢ statistic. For the case in which k = 2, the two distribu- 
tions are identical. For k greater than 2; corresponding critical on in 
the Dunnett tables are larger. For example, with the degrees of free: ex 
for error equal to 10 and k — 7, two-tailed tests with joint significance leve 
05 are equivalent to individual two-tailed tests at the .01 level. It is 2 
of interest to compare the critical values for this type of comparison ae 
those made by means of the Tukey (a) test. The two critical values "d e 
cast in terms of comparable units of measurement by means of the relation- 


ship 
а= 2. 


of Dunnett’s £ statistic with 
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Corresponding critical values for .05 joint significance levels are as follows: 


df for error k Tukey (a) Dunnett 
| y@ | 


10 8 5.30 4.68 
20 6 4.45 3.97 
© 10 447 3.89 
10 2 3.15 3.15 


For values of k greater than 2, the Tukey (a) values are higher than the 
corresponding Dunnett values. Since the Tukey (a) test was designed to be 
appropriate for all possible differences, of which comparisons of each 
treatment with the control are a subset, it is reasonable to expect that the 
Tukey (a) tests would require the larger critical values. 


3.11 Tests for Homogeneity of Variance 


One of the basic assumptions underlying both models I and II is that the 
variance due to experimental error within each of the treatment populations 
be homogeneous, that is, о = о сё. Moderate departures 
from this assumption do not, however, seriously affect the sampling dis- 
tribution of the resulting F statistic. That is, when the variances in the 
population are not equal, the F statistic using a pooled variance has approxi- 
mately the same distribution as the F statistic which takes the differences in 
the population variances into account. The following examples, taken 
from Box (1954, p- 299), illustrate the effect of lack of homogeneity of 
variance. In these examples n = 5, and k = 3. 


———— ÁÀ— s 


Populations oF 
Probability of F 
1 2 3 exceeding F. 95 
(a) Variances 1 1 1 .050 
(b) Variances 1 2 3 .058 
(c) Variances 1 1 3 .059 


Т Se EE eee 
In (a) all variances are equal; hence an F statistic which pools variances 
has probability of .05 of exceeding Fs; when тү = т, = та. In (b) the 
variances have the ratio 1:2:3; that is, the variance for the second popula- 
tion is twice the variance of the first population, and the variance of the 
third population is three times the variance of the first population. For 
this case, the exact sampling distribution (assuming т) = т; = тз) for the F 
statistic shows probability equal to .058 of exceeding Fy, obtained from 
the F statistic which assumes о = 0 = 02. Using the ordinary F test 
when the population variances are in the ratio 1:2:3 gives a test having a 
small positive bias, since relatively more significant results will be obtained 
than the exact sampling distribution warrants. In (c) the ratio of the 
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variances is 1:1:3; the F test in this case would also have a small positive 
bias, since the probability of exceeding F з is .059 rather than .050. When 
the number of observations in each treatment class varies considerably, Box 
indicates that the bias becomes somewhat larger and the direction of the 
bias is not always positive. Also, the greater the skewness in the distribu- 
tion of the population variances, the more bias in the resulting tests. 

In cases where the experimenter has no knowledge about the effect of the 
treatments upon the variance, tests for homogeneity of variances may be 
appropriate as preliminary tests on the model underlying the analysis. 
There is no need, however, for a high degree of sensitivity in such tests, 
because F tests are robust with respect to departures from homogeneity of 


Table 3.11-1 Numerical Example 


(n = 10) 
oO RBBB ee LU ойе 
Treatment 1 Treatment 2 Treatment 3 Treatment 4 
у | TE 95 83 220 
хх) 2320 1802. 869 6640 
Т?п 1960.00 902.50 688.90 4840.00 
SS; 360.00 899.50 | 180.10 | 1800.00 
sj 40.00 99.94 20.01 | 200.00 


| | ESS; = 3239.60 
| Zs? = 359.95 
___ НИИ елемес ма Ст. 
variance. The experimenter need be concerned about only relatively large 
departures from the hypothesis of equal population variances. 
A relatively simple, but adequate, test of the hypothesis that 


=: = 0 


[35 


оў = о; 


hen nis constant for all the k treatments 


nae bape ans of the statistic 


in an experiment, this hypothesis may be tested by те 


Р, largest of k treatment variances 
max smallest of k treatment variances 


„2 
__ Slargest 


SSmallest 

Under the hypothesis that of = о ==... = o the sampling Шор 
Of the Fmax statistic (assuming independent random samples from norma 

Populations) has been tabulated by Hartley. This distribution is given m 
Table B.7. The parameters for this distribution are k, the number o 

treatments, andn — 1, the degrees of freedom for each of the treatment class 
Variances. If the observed Fmax is greater than the tabled value associated 


With an a-level test, then the hypothesis of homogeneity of variance is re- 


jected. This test will be illustrated by use of the data in Table 3.11-1. 
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In this table k = 4, and п = 10. SS, is the variation within treatment 
class j and is given by 
2 
OSEE Me dw 
n 


The variance within treatment class j is given by 


The largest of the within-class variances is 200.00; the smallest is 20.01. 
Thus, the numerical value of the Fmax statistic for these data is 


200.00 
Fmax E 2001. — 10.0. 


From the tables of the Fmax distribution, Fmax „(4,9) = 9.9. Since the 
observed value of the Fmax statistic is greater than the critical value for a 
-01-level test, the hypothesis of homogeneity of variance is rejected. This 
test, referred to as Hartley’s test, is in practice sufficiently sensitive for use 
as a preliminary test in situations where such a test is in order. When the 
number of observations in each of the treatment classes is not constant, but 
the л/$ are relatively close to being equal, the largest of the sample sizes may 
be used instead of n in obtaining the degrees of freedom required for use in 
the Hartley tables, This procedure leads to a slight positive bias in the test, 
i.e., rejecting H, more frequently than should be the case. 

Another relatively simple test for homogeneity of variance, developed by 
Cochran, uses the statistic 


Slargest 
C= 2806 

ds? 
The parameters of the sampling distribution of this statistic are К, the 
number of treatments, and n — 1, the degrees of freedom for each of the 
variances, Tables of the 95th and 99th percentile point of the distribution 
of the C statistic are given in Table B.8. For the data in Table 3.11-1, 


For a .01-level test the critical value is C.99(4,9) = .57. The observed value 
of Cis quite close to the critical value but does not exceed it. However, the 
experimenter should on the basis of this result seriously question the ten- 
ability of the hypothesis of homogeneity of variance. 

In most situations encountered in practice, the Cochran and Hartley tests 
will lead to the same decisions. Since the Cochran test uses more of the 
information in the sample data, it is generally somewhat more sensitive than 
is the Hartley test. In cases where n,, the number of observations in each 
treatment class, is not constant but is relatively close, the largest of the 7/5 
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may be used in place of л in determining the degrees of freedom needed to 
enter the tables. 

Bartlett’s test for homogeneity of variance is perhaps the most widely 
used test. The routine use of Bartlett’s test as a preliminary test on the 
model underlying the analysis of variance is not, however, recommended. 
Only in relatively few cases is Bartlett’s test useful. From the computa- 
tional point of view it is more complex than is either the Hartley test or the 
Cochran test. In Bartlett’s test the п/віп еасһ of the treatment classes need 
not be equal; however, no n; should be smaller than 3, and most n,’s should 
be larger than 5. The statistic used in Bartlett's test is 


x = А38 (f log MSerror — Уу, log 52), 
c 
where f, — n; — 1 — degrees of freedom for BT 
f-f, = degrees of freedom for MSerror, 
Jf Sip 2. 
c= + {z} -4), 
Зе ОМ 
SS; 
MSerror ri zm . 
27, 
When c? = 02 = ++ = оў, the sampling distribution of the x? statistic is 


approximated by the 7? distribution having k — 1 degrees of freedom. 

The data in Table 3.11-1 will be used to illustrate the computation of the 
x? statistic. Computations will be indicated for the case in which the пу 
are not assumed to be equal. For this case MS,,,,, is most readily obtained 
from 
ESS, _ 3239.60 4999. 
xf, 36 
Since f, — 9 +9 +9 +9 = 36. Other items required for the computa- 
tion of the 7? statistic are 

flog MSerror = 36 log 89.99 = 36(1.954) = 70.344 


MSerror = 


fr log 5? = 9log 40.00 = 9(1.602) = 14.418 
flogs? = 9log 99.94 = 9(1.999) = 17.991 
fylogs? = 9log 20.01 = 9(1.301) = 11.709 
fr log s; = 9 log 200.00 = 9(2.301) = 20.709 
Xf, log sj | 64827 
hu ели 1 1-2) 
а= Е Е Зе 


1 4- $86 
— 1.046. 


Il 
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From these terms, the 7? statistic in Bartlett’s test is 


уш 2202 (70.344 64.827) = 12.14. 


1.0 
The larger the variation between the 575, the larger will be the value of the 
x? statistic. Fora .01-level test of the hypothesis that o? = 05 = --- = o? 


the critical value is 7%(3) = 11.3. Since the observed value of the 7? 
statistic is larger than the critical value, the experimental data do not support 
the hypothesis being tested. 

For the data in Table 3.11-1, the Hartley, Cochran, and Bartlett tests give 
comparable results. The Hartley test uses what is equivalent to the range 
of the sample variances as a measure of heterogeneity, whereas the Bartlett 
test uses what is equivalent to the ratio of the arithmetic mean to the geo- 
metric mean of the variances. The sampling distribution of the latter 
measure has a smaller standard error and hence provides a more powerful 
test of the hypothesis being tested. For purposes of detecting large depar- 
tures from the hypothesis of homogeneity of variance, either the Hartley or 
the Cochran test is adequate in most cases occurring in practice. 

There is some evidence that all the tests for homogeneity of variance that 
have been discussed above are oversensitive to departures from normality 
of the distributions of the basic observations. Bartlett and Kendall have 
proposed a test for homogeneity of variance which is less sensitive to non- 
normality than any of the above tests. А detailed description and applica- 
tions of the Bartlett and Kendall test will be found in the work of Odeh 
and Olds (1959). 


3.12 Unequal Sample Sizes 


The plan of an experiment may call for an equal number of observations 
under each treatment, but the completed experiment may not meet this 
objective. For comparable precision in the evaluation of each treatment 
effect this objective is a highly desirable one, assuming that the variances 
for the treatment classes are equal, Circumstances not related to the experi- 
mental treatments often prevent the experimenter from having an equal 
number of observations under each treatment. For example, in animal 
research deaths may occur from causes in no way related to the experimental 
treatments. In areas of research in which people are the subjects, it may 
be that only intact groups can be handled; such intact groups may vary in 
size. 

In the earlier sections of this chapter it is generally assumed that random 
samples of size n were assigned at random to each of the treatments. In 
this section it will be assumed that a random sample of size л, is assigned 
to treatment І, a random sample of size п, is assigned to treatment 2, etc. 
The size of the random sample is not assumed to be constant for all treat- 
ments. The form of the definitions of the sums of squares is different from 
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those appropriate for the case in which the sample size is constant. The 
notation that will be used is outlined in Table 3.12-1. The number of 
treatments in the experiment is k. The treatments are designated by the 
symbols 1, 2,..., /,..., Ё, where the symbol j represents any treatment 


Table 3.12-1 Notation 


Treatment 1 | Treatment 2|---| Treatment j | > ++ | Treatment К 

Number of 

Observations т т п; e N = Xn; 
Sum of 

observations Ti Т, Т, Т, G — XT, 
Mean of © У = =; = 

observations Т, Т, Т) Т, G = GIN 
Sum of squares of 

observations xxi xxi Ух} xx; ŒX} 
Тіп, Tin Т/л Тіп, Tiny 
Within-class 

variation SS, SS, “аз SS; Jn SS, 
Within-class aS SS, ge SS, qe LAUR = SSy 

variance "ml hm TTE n 1 
l L M ЕсСЗБЕаЈЕБЦЕЕЕЕЕЦЕЕЕЕЕ ЕСЕ 


within the set. The size of the sample observed under treatment jis desig- 
nated by the symbol п;. The total number of elements in the experiment is 


ny +n te п = М. 
Computational formulas are summarized in Table 3.12-2. Symbols (1) 


Table 3.12-2 Computational Formulas 


G = XX (8) = :(1) 
0 = @ = 30%} = 
NEN 0 — — 
SStreat = (3) — (1) або =k 1 
SSerror = (2) - (3) dferror = N — k 


SStota = (2) - (1) dftota =N — 1 


and (2) have the same general form as they do for the case of equal n’s. 
However, symbol (3) is different; that is, 

f i 

pus m Ng Ty 


Thus each Т? must be divided by its л, before the summation is made. The 


degrees of freedom for each of the sums of squares are also shown in Table 
3.122. For $$ the number of degrees of freedom is the pooled degrees 
of freedom for the variation within each of the treatments, 1.©., 
dferror = (лу — 1) + (п — D + 0% — 1) 
= № К. 
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єгєє = (1) – @) = "ss. 


ITEL = (£) — @) = "ss 


(ш) 
167662 = (1) — (£) = sg 
67196 = (и г)х = (£) SEOL = 2X55 = (0) 88124 = 97/(LED) = А/О = (D 
ee O EV ENO VS 18 ба. 
<=? = ә ЕР6 = <%<-ч 004 = 5 oot = Ч 
565 = is LOI = & 00's = i ooz = іх 
IT'EL = "585 lus = "ss 054 = 95 00702 = ‘ss 00°01 = 'ss 
647196 ІНЕ 6c s 1 > ^ * i e w 
ns %9-%; osor = gr 0050 = FF 00%5 = FF ! 
seot = (xxx 859 = EXT st = EX= 592 = {Хх = INS 
18179 99 = s=" ж-ч а= 
«=м 1-" 8 = Wu -ч 9 ш 
—————————M— d ILLAE 
П 
6 є 
01 z $ 
а А 9 1 
е z 01 £ (0) 
8 1 * 4 
а z 8 [4 
ot є L є 
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The computational formulas given in Table 3,12-2 become those for the case 
of equal sample sizes when n, is constant for all treatments, Hence the 
computational formulas for equal n’s are simplified special cases of those 
given in Table 3.12-2. 

The basic partition of the over-all variation now has the form 


(1) ХХХ, ai Gy -- ХХХ, т» Ty * NC, ~ бу, 
SSioti = SSerror + SStreat 


Thus in the computation of $$, each (T, = G)* is multiplied by л, 
Hence the variation due to the treatments is a weighted sum of the squared 
deviations of the treatment meansabout the grand mean; squared deviations 
based upon a large number of observations are given greater weight than 
those based upon a small number of observations, (А definition of SS,,,,, 
which assigns equal weight to the squared deviations is given at the end of 
this section, The latter definition is to be preferred in cases where 
differences in the 51's have no direct meaning in terms of the population 
about which inferences are being drawn.) 

A numerical example is given in Table 3.12-3, Data in part i represent 
the basic observations, Detailed summary data are given in part іі. The 
symbols used in part іі are defined in Table 3.12-1, For example, 


SS, = SX? — 1 = 64 — 5400 = 10.00. 
1 


As a rough check for homogeneity of variance, 


- Яшен „595, 5.56. 
Рам Raia 4 107 — 
The largest of the ns is 8. For a .05-level test, an а te critical 


value is Finax „(К = 4, df=8—1 eee ci og The data do not contra- 
dict the hypothesis of homogeneity of variance. 

Use of de computational wand is illustrated in part iii. There are 
alternative methods for computing these sums of squares, SS, may be 
Obtained from 

55етог = ESS, = 73.21. 
From the definition, 


SStroat = En (T, — С)? 
= 6(3.00 — 5.27)! + 5(7.00 — 527)! + $225 — 5.27 + 19.43 — 527) 
= 239,95, 

For purposes of showing just what it is that forms the sum of squares for 


crror and treatments, these alternative computational methods are more 
revealing, but they also involve more computational effort. 
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The analysis of variance is summarized in Table 3.12-4. A test on the 
hypothesis that all the treatment effects are equal is given by the F ratio, 


= MStreat _ 79.97 _ 24.02, 


MSerror 3.33 
The critical value for a .01-level test is F y9(3,22) = 4.82. Тһе data contra- 
dict the hypothesis of no differences in treatment effects. Inspection of the 
treatment means indicates that treatments 2 and 4 are quite different from 
treatments 1 and 3. 
A comparison has the same form as that for the case of equal n’s, that is, 


С = cT, + Т +: 4 — where Ec, = 0. 


F 


Table 3.12-4 Analysis of Variance 


Source of variation 55 df MS F 
Treatments 239.91 3 79.97 24.02** 
Experimental error 73.21 22 3.33 

Total 313.12 25 


—_—-—_— —_——_—_ 
**F (3,22) = 4.82 


A component of variation corresponding to a comparison is in this case 
С? 
(ст) + (сл) + +++ + (суп) 


Two comparisons, 


C, = а + en, +++: + с] 
) С» = Cay Ty + сот +. + са T,, 
are orthogonal if 
спс] ка Ciao» N 121321388 0 
m т п. 


These definitions reduce to those given for equal n’s when all the л, are 
equal. 


Computational procedures for comparisons will be illustrated through 
use of the data in Table 3.12-3. The component of variation corresponding 
to the difference between treatments 2 and 4 is 

(Т1 _ (7.00 — 9.43)? 
[G)*/nj] + [C-1)/n4] +++ 
To test the hypothesis that т; = 74, the statistic used is 

SS; _1720_ 
ММЗ 0313388: 


SSe = 17.20. 


F= 


SAG 
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The numerator of this statistic has 1 degree of freedom, the denominator 
22 degrees of freedom (the degrees of freedom for М8). For a .0I-level 
test the critical value is F,,(1,22) = 7.94. Since the observed F statistic 
does not exceed the critical value, the data do not contradict the hypothesis 
that т, = 7,. 

As another example, suppose that a meaningful comparison planned in 
advance of the experiment is that of treatment 4 with all others combined; 
i.e., does treatment 4 differ from the average of all other treatment effects? 
This comparison is given by 


(37, — Т, — Т — Т)? [3(9.43) — (3.00) — (7.00) — (2.25)? 
0 (ст) + (e/m) + (c/n) + (c/n) ERE ERE 
— 144.54. 
To test the hypothesis that т, = (т + Ta + 73)/3, the statistic used is 
эзсе ara. 


MSerror 3.33 
The critical value for a .01-level test is F.94(1,22) = 7.94. Hence the data 
clearly contradict the hypothesis that the effect of treatment 4 is equal to 
the average effect of the other three treatments. 

When the n;’s do not differ markedly, the Newman-Keuls, the Duncan, 
or either of the Tukey methods may be adapted for use in making tests on 
differences between all pairs of means. The Newman-Keuls method will 
be used to illustrate the principles involved. With unequal sample sizes it 
is convenient to work with the treatment means. (For the case of equal 
sample sizes it is more convenient to work with the treatment totals.) „The 
example in Table 3.12-3 will be used to illustrate the numerical operations. 
Part i of Table 3.12-5 gives the treatment means arranged in order of 
increasing magnitude. The differences between all possible pairs of means 
are shown. For example, the entry 7.18 in the first row is the difference 
9.43 — 2,25. The entry 4.75 is the difference 7.00 — 2.25. In general an 
entry in this table is the difference between the mean at the top of the column 
and the mean at the left of the row. Р 

The statistic to be used in making tests on these differences is q, 

EIS 
4 /MSerror/n ; 
where ғ is the number of steps the two means are apart on an ordered scale. 


The л in the expression V/MS,,,,,/n refers to the number of observations in 
each of the means and is assumed to be constant. If the л/8 do not differ 
markedly from each other, the harmonic mean of the njs may be used 
instead of n in this expression. Тһе harmonic mean ñ 15 defined as 

k 


"= ym) + тд 4-4 Cm)’ 
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For the numerical example, 


4 
ee SO 
bettie 

Since the degrees of freedom for MS,,,,, are 22, the critical values for the 
q, Statistic are found in the tables of the studentized range statistic in the 
row corresponding to 22 degrees of freedom. The critical values for a 
-01-level test are given in part ii of Table 3.12-5. Thus 3.99 is the critical 


ñ= 


Table 3.12-5 Tests on Differences between All Pairs of Means 


Treatments 3 1 2 4 


Means 2.25 3.00 7.00 9.43 


З 2.25 — 0.75 4.75 7.18 
(i) 1 3,00 — 4.00 6.43 
2 7.00 == 2.43 
4 9.43 — 
r= r=3 r=4 
(ii) Q.95(r,22) 3.99 4.59 4.96 
(iii) VMSerror/#G.99("22) 2.90 3.34 3.61 
3 1 2 4 
3 жж жж 
(іу) 1 жж жж 
4 


value for the 4, statistic when r = 2, that is, when the means are two steps 
apart; 4.59 is the critical value for qr Whenr = 3. In making several tests 
it is convenient to work with the critical value of the difference between a 
pair of means rather than the critical value of g,. Since 


MSerror/ft q, = Т, ж Т,, 
the critical value for the difference between two means is 


у MSerror/ ñ 4-а(ғ,4Ғ), 


The numerical value of VMS,,,0,/f is in this case V3.33/6.30 = .727. 
Hence the critical values for .01-level tests on the differences between pairs 
of means are given by multiplying the entries in part ii of Table 3.12-5 
by .727. These values are given in part iii. For example, the entry 
2.90 = (.727)(3.99). 

The sequence in which the tests must be made is given in бес. 3.8. This 
sequence must be followed here. 
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1. The first test made is on the difference 7.18 in the upper right of part 
i. Since this entry is the difference between two means that are four steps 
apart, the critical value is 3.61. Hence the hypothesis that ту = 7, is con- 
tradicted by the experimental data. 

2. The next test is on the entry 4.75, the difference between two means 
which are three steps apart. The critical value for this test is 3.34. Hence 
the data contradict the hypothesis that т; = тз. 

3. The entry .75, which is the difference between two means that are two 
steps apart, is tested next. The critical value is 2.90. Hence the data do 
not contradict the hypothesis that т) = 75. 

4. The entry 6.43 is tested against the critical value 3.34, since this entry 
is the difference between two means which are three steps apart. Hence the 
data contradict the hypothesis that т) = 74. 

5. The entry 4.00 is tested against the critical value 2.90. The data con- 
tradict the hypothesis that т) = тз. 

6. The entry 2.43 is tested against the critical value 2.90. The data do 
not contradict the hypothesis that т; = 74. 

A summary of the tests is given in part iv. The cells with asterisks 
indicate that the corresponding differences are statistically significant at the 
01 level. Schematically this summary may be represented as follows: 


3 1 270 94 


Treatments underlined by a common line do not differ; treatments not 
underlined by a common line do differ. Hence treatments 2 and 4 differ 
from treatments 3 and 1, but there is no difference between treatments 2 
and 4 and no difference between 3 and 1. 

In adapting the Duncan method or either of the Tukey methods to the 
case of unequal sample sizes, the harmonic mean й is used in place of n. 
For example, in the Tukey (a) test the critical value for .01-level tests on all 
differences would be УМ, „„./7 4.0004), where k is the number of treat- 
ments and df is the degrees of freedom for MS,,,. For the numerical 
example, this critical value is 3.65. In this case the Tukey (a) test would give 
outcomes identical with those obtained by means of the Newman-Keuls test. 

The definition of 55... given earlier in this section requires that each 
(T, — G)? be weighted by n;. If the error variances for each of the treat- 
ments are equal, this weighting procedure essentially weights each squared 
deviation by a factor proportional to its standard error, which is се]. Jf, 
on the other hand, the zs are in no way related to the hypothesis being 
tested and it is desired to give each treatment mean a numerically equal 
weight in determining 58.» then the latter source of variation may be 
defined as SStreat = A(T; — Gy, 

T, 


Where c= zh n 
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This definition of G differs from that used earlier in this section. If the 
latter definition of $$, is used, 55, + SSerror Will not be numerically 
equal to 55,1. 


3.13 Determination of Sample Size 


The level of significance of an over-all F test in the analysis of variance 
sets an upper bound on the type 1 error in the decision rule. By having a 
suitably large n under each treatment, the power of the test with respect to 
a specified alternative hypothesis may be made as large as desired. The 
specification of the alternative hypothesis is in terms of the parameter 


‚ _ (Ұш- ҮК 
ne E 


where о? is the experimental error per experimental unit. 

The charts given in Table B.11 estimate the power of the F test in the 
analysis of variance as a function of $’, k = number of treatments, n = 
number of experimental units under each treatment, and « — level of signif- 
icance of test. 

Before these charts may be used, some estimate of ¢’ must be obtained. 
This estimate need be only a rough one. Past experimentation, a pilot 
study, familiarity with the subject matter—all these sources of information 
will contribute to the estimation procedure. To illustrate one method for 
estimating ф', suppose that the experimenter considers a difference 


Шш-и-5 
to be a practically important difference. Suppose that k = 4 and that 
о? is estimated to be 100. Then 


(и, —uy 5 4 5° + 5° 4 5° 
k 4 E 
and ф = / 38, = .50. 


If the over-all test is to be made at the .01 level of significance, and if power 
.90 is desired with respect to ¢’ = .50, then, from the chart in Table B.11 
in which К = 4and « = .01, it will be found that = 20 experimental units 
per treatment are required, i.e., a total of 80 experimental units for the 
experiment. 

If, on the other hand, the over-all test is to be made at the .05 level of 
significance and power .90 is required with respect to $”, then n = 15 
experimental units per treatment will be needed. 


25, 


CHAPTER 4 


Single-factor Experiments Having 
Repeated Measures on the 
Same Elements 


4.1 Purpose 


In experimental work in the behavioral sciences the elements forming the 
statistical population are frequently people. Because of large differences 
in experience and background, the responses of people to the same experi- 
mental treatment may show relatively large variability. In many cases, 
much of this variability is due to differences between people existing prior 
tothe experiment. If this latter source of variability can be separated from 
treatment effects and experimental error, then the sensitivity of the experi- 
ment may be increased. If this source of variability cannot be estimated, 
it remains part of the uncontrolled sources of variability and is thus auto- 
matically part of the experimental error. ^ 

One of the primary purposes of experiments in which the same subject is 
Observed under each of the treatments is to provide a control on differences 
between subjects. In this type of experiment, treatment effects for subject 
i are measured relative to the average response made by subject i on all 
treatments. Іп this sense each subject serves as his own control—responses 
of individual subjects to the treatments are measured in terms of deviations 
about a point which measures the average responsiveness of that individual 
Subject. Hence variability due to differences in the average responsiveness 
of the subjects is eliminated from the experimental error (if an additive 
model is appropriate). 

Expériniénisih Bon the same elements are used under all the К treat- 
ments require k observations on each element. Hence the term repeated 
measurements to describe this kind of design. To the extent that unique 
characteristics of the individual elements remain constant under the different 
treatments, pairs of observations on the same elements will tend to be 
positively correlated. More generally, the observations will be dependent 
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rather than independent. If the population distributions involved are 
multivariate normal, the terms dependent and correlated are synonymous; 
analogously, the terms independent and uncorrelated are synonymous in 
this context. Since the models that will be used are assumed to have under- 
lying multivariate normal distributions, correlated measurements imply 
statistically dependent measurements. The designs in this chapter may be 
said to involve correlated, or dependent, observations. 

The notation to be used and general computational procedures to be 
followed are given in the next section. The rationale underlying the 
analysis and special uses of these designs is presented in later sections. 


4.22 Notation and Computational Procedures 


Notation for this type of design will be illustrated in terms of people as 
the elements of the statistical population. However, the notation is not 


Table 4.2-1 Notation 


Treatment 
Person 1 2 ыы j Ж k Total Mean 
1 Ха Xiz Ху Ху Р, Р, 
2 Xn X» Xy; Хау Py Р, 
t Xa Ха Xi Ха P; Р, 
л Xn Ха Xu Xu P, P, 
Total Т, s ee Т; ee ТЕ” G 
Mean 7, 7, e T, ee Т, б 


restricted to this case. In Table 4.2-1 the symbol X; represents the meas- 
urement on person 1 under treatment 1, Х the measurement on person 1 
under treatment 2, X;; the measurement of person 1 under treatment j. In 
general the first subscript to an X indicates the person observed and the 
second subscript the treatment under which the observation is made. 

The symbol P, represents the sum of the k observations on person 1, Рз 


the sum of the k observations оп person 2, P; the sum of the k observations 
on person i. In summation notation, 


P, = УХ; 
2 


that is, P, is the sum of the К entries in row i. Summation over the sub- 
script j is equivalent to summing over all columns within a single row. The 
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mean of the observations on person i is 


The symbol Т, represents the sum of the л observations under treatment 1, 
T, the sum of the n observations under treatment 2, T; the sum of the n 
observations under treatment j. In summation notation, 


Тао 
: 


Summation over the subscript / is equivalent to summing all entries in a 
single column. The mean of the n observations under treatment j, desig- 
nated 7, is 


The sum of the kn observations in the experiment, designated G, is 
СУВ =Т= ЕХ, 


The symbol ХУ Х,, represents the sum over all observations in the experi- 
ment. The grand mean of all observations, designated G, is 


In the analysis of this type of experiment, the total variation is divided 
into two parts: one part is a function of differences between the means of 
the people; the other part is a function of the pooled variation within 
individuals. The total variation is 


(1) SStoi = ZX(X;, — 6), 
the sum of the squared deviations of each observation about the grand mean. 


This source of variation has kn — 1 degrees of freedom. That part of the 
total variation due to differences between the means of the people is 


(2) SSpetween people = КУ(Р, лі Gy. 


In words, the between-people variation is a function of the squared seh 
tions of the means for the people about the grand mean. Alternative у, 
this source of variation may be viewed as due to the differences between a : 
possible pairs of P,; the larger such differences, the larger this source г 
variation. Since there are л means, this source of variation has n — 
degrees of freedom. 

The variation within person / is 


SS. personi — Хх ЕУ BS 
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the sum of the squared deviations of the observations on person i about the 
mean for personi. This source of variation has k — 1 degrees of freedom. 
The pooled within-person variation, designated 55, people» is 


(3) SSw. people = 55у. person i = EX(X;; э By. 


Since the variation within each person has k — | degrees of freedom, the 
pooled within-person variation will have n(k — 1) degrees of freedom. It 
is readily shown that the between- and within-people sources of variation 
are statistically independent and that 
SStotat = SSpetween people + SSy. people. 
The degrees of freedom corresponding to these sources of variation are also 
additive, 
kn — 1 = (n — 1) + n(k — 1). 
To show this partition of 55, algebraically, let 


bi; = Xi; P3 B, 

а= P, — б. 
Then, 2b; = 0 for all i, Ха, = 0, Ха = па. 
Непсе, УУаь, == Уа(Хь,) = Хо) = 0. 


From the definitions of Б, and а;, it follows that 
Х„—б= bi; + а. 
Hence, $$ ш XX(X;— б) = УУ(Ь, + а) 
17 Ж) 
= 22% + ХХ + 2УУаь,, 
= УУР, + пуа? + 200) 


= SSw. people == SSpetween people. 


The difference between two observations on the same person depends in 
part upon the difference in treatment effects and in part upon uncontrolled 
or residual sources of variation. Hence the pooled within-person variation 
may be divided into two parts: one part which depends upon differences 
between the treatment means, and a second part which consists of residual 


variation. That part which depends upon differences between treatment 
effects is defined as 


(4) SStreat = nX(T; = б). 
Alternatively, this source of variation may be expressed as 

Sf —m 

SS nXT, — T,y 


k(k — 1) 
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The expression 7; — Т, represents the difference between a pair of treatment 
means; the summation is with respect to all possible pairs of treatment 
means, order within the pair being disregarded. For example, if k = 3, 
п — TY + (% — Ty + (00 — Ts)" 
3(2) 
This source of variation has k — 1 degrees of freedom. 
The residual variation is 

(5) SSres = EX[(X;; — 6) — (Р; — 6) — (7; — GP. 

The terms that are subtracted from X,, — Саге, respectively, the person and 
treatment effects so that the residual variation represents those sources of 


SStreat 


Partition of the total variation Partition of the degrees of freedom 


Total variation 


Between-people Within-person 
variation 


variation 
Between-treatment 
variation 


Residual 
variation 


(п-1)(%-1) 


Figure 41 Schematic representation of the analysis. 


variation in the total that cannot be accounted for by differences between 
the people and differences between the treatments. The degrees of freedom 
for the residual variation are 
dfres = бот — dfbetween people — dfireat 
(kn — 1) (n — 1) (k — 1) 
=kn—n—k+1=n(k —1)—(k—1) 
= (Бе nd 
It is readily shown that SS,,,, and 55, аге statistically independent and 

that 


SSw. people = SStreat + SSres- 
The degrees of freedom for the corresponding variations are also additive, 
il 

( n(k — 1) = (k — 1) + (n — 1X — 1). 


The analysis of the sources of variation and the corresponding degrees of 


freedom is shown schematically in Fig. 4.1. i 
The definitions of the sources of variation do not provide the most con- 
venient formulas for their computation. Formulas for this purpose are 


summarized in Table 4.2-2. The symbols (1), (2), and (3) are identical to 
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those used in the case of single-factor experiments which do not have re- 
peated measures. Symbol (4) occurs only in experiments having repeated 
measures. In each case the divisor in a term is the number of observations 
that are summed to obtain an element in the numerator. For example, G 
is the sum of kn observations; T; is the sum of n observations; P, is the sum 
of k observations. A summary of the analysis of variance appropriate for 
this design is given in part ii of this table. Mean squares are obtained from 


Table 4.2-2 Summary of Computational Procedures 


G) (1) = бп (2) = xxx: (3) = (ET5)In (4) = (PK 


Source of ss 


variation a 
Between people | SSpetween people = (4) — (1) лі— 1 
(ii) Within people | SSw, people = (2) — (4) n(k — 1) 
Treatments | SStreat = (3) — (1) k—1 
Residual SSres = (2) - (3) — (4) + (1) (n — Ik = 1) 
Total СЕРТ = (2) - (1) | -1 


corresponding sums of squares by dividing the latter by their respective 
degrees of freedom. 


The F ratio 
d MStreat 
MSres 
provides a test of the hypothesis that т, = v, = * = T}, where the 7's 


represent treatment effects and are defined in the same manner as they were 
for the case of designs not having repeated measures. The rationale under- 
lying the use of this statistic for this test is discussed in Sec. 4.4. 

Under one set of assumptions (made explicit in Sec. 4.4) about the under- 
lying sources of variation, the F ratio has a sampling distribution which is 
approximated by the F distribution having k — 1 and (n — 1)(k — 1) 
degrees of freedom. This is the usual test. Under less restrictive assump- 
tions about the relations between the underlying sources of variation, Box 
(1954) has shown that the F ratio in the last paragraph has a sampling distri- 
bution (assuming that all т; = 0) which is approximated by the F distribu- 
tion having (k — 1)9 and (n — 1)(k — 1)0 degrees of freedom, where 0 is а 
quantity which depends upon certain homogeneity assumptions. The 
maximum value of 6 is 1.00, and the minimum value is 1/(k — 1). The 
maximum value of 6 is attained when the homogeneity assumptions under- 
lying the usual test are met. Use of the minimum value of 0 provides а con- 
servative test. Thus, if the ratio 

po MStreat 
MSres 
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is assumed to be distributed as an F distribution with 1 andn — 1 degrees of 
freedom (assuming that all т, = 0), one has a conservative test relative to the 
usual test. However, the assumptions underlying this test are much weaker 
than those underlying the usual test. (Conservative in this context implies 
that a larger value of the F ratio is required for statistical significance at a 
specified level of «.) 


4.3 Numerical Example 


The computational procedures described in the last section will be illus- 
trated by means of the numerical example in Table 4,3-1. The statistical 
basis for the analysis is discussed in the next section. The purpose of this 
experiment was to study the effects of four drugs upon reaction time to a 
Series of standardized tasks. All subjects had been given extensive training 
on these tasks prior to the experiment. The five subjects used in the experi- 
ment could be considered a random sample from a population of interest to 
the experimenter. 

Each subject was observed under each of the drugs; the order in which a 
subject was administered a given drug was randomized. (In designs con- 
sidered in later chapters, the order in which treatments are given to the same 
Subject is either controlled or counterbalanced.) A sufficient time was 
allowed between the administration of the drugs to avoid the effect of one 
drug upon the effects of subsequent drugs, i.e., an interaction effect. The 
numerical entries in Table 4.3-1 represent the score (mean reaction time) on 
the series of standardized tasks. Thus person | had scores of 30, 28, 16, 
and 34 under the respective drug conditions. The total of these scores is 
108; thus the numerical value of P, is 108. Тһе other values for the P's are 
obtained by summing the entries in the respective rows in part i. The 
numerical values for the 775 are obtained by summing the columns. For 
example, 7) is the sum of the five entries under drug І. The grand total, 
G, is obtained either by summing the P’s or by summing the T’s. A check 
оп the arithmetic work is provided by computing G by both methods. | 

Quantities required in the computation of the sums of squares are given in 
part ii. The first three of these quantities are identical to those computed for 
designs which do not involve repeated measures. Symbol (4) is obtained 
from the P’s. Each Pis the sum over k = 4 drugs; hence the divisor associ- 
ated with the symbol (4) is 4. The computation of the sums of squares 
required in the analysis of variance is illustrated in part iii. Ап alternative 
method for computing SS,,, is 

SSres = SSw. people — SSarugs 
= 811.00 — 698.20 = 112.80. 


The latter method is actually simpler than the method used in part iii; 
however, the method in part iii provides a partial check on the numerical 
Work, since the sum of 55, and $$, Should total SS, оне 
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The analysis of variance is summarized in Table 4.3-2. The F ratio 


p — MSwreat _ 232.73 _ 24.76 


MSres 9.40 


is used in testing hypotheses about reaction time as a function of the effects 
ofthe drugs. Fora .01-level test on the hypothesis that т, = т; = 73 = T4, 
the critical value for the F ratio is F,,(3,12) = 5.95. The experimental 


Table 4.3-1 Numerical Example 


Person | Drug 1 Drug 2 Drug 3 Drug 4 Total 
1 30 28 16 34 108 = P, 
2 14 18 10 22 64 — P, 
3 24 20 18 30 92 =P; 
4 38 34 20 44 136 — P, 
5 26 28 14 30 98 — P; 
132 128 78 160 498 =G 
T, T, T, T, 
. G* (498? 248,004 
(1) = Ge ШО = 12,400.20 
(2) = УУХ? = 13,892 
(ii ET? 1322 4 128° + 78? 2 
' @=20 = 2 19 BE = 13,008.40 
XP? 108° + 642 + 922 + 136° 2 32. 
а) -2 : + 1367 + 98? _ 52,324 — 13,081.00 
4 
pee OE жата ee ы... | 
SShetween people = (4) — (1) = 13,081.00 — 12,400.20 = 680.80 
654 people = (2) — (4) = 13,892 — 13,081.00 = 811.00 
(іі) SSarugs = (3) — (1) = 13,098.40 — 12,400.20 = 698.20 
55, -(2)-(3) - (4) +4) 
= 13,892 — 13,098.40 - 13,081.00 + 12,400.20 = 112.80 
SStotat = (2) - (1) = 13,892 — 12,400.20 = 1491.80 


een ee тык ee LÁ 


data contradict this hypothesis. Inspection of the totals for the drugs in 
Table 4.3-1 indicates that drug 3 is associated with the fastest reaction. 

Suppose that it had been anticipated before the experiment had been 
conducted that drug 3 would have a different effect from all others. The 
comparison that would be used in testing this hypothesis is 


C= 3T; — T; — Tz — T, = 3(78) — 132 — 128 — 160 186. 
The component of variation corresponding to this comparison is 
T Gi (—186)? 
nic 5[3° + (—1)? + (—1)? + (—1] 


576.60. 
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The F statistic 55 
жезазба ада 310,00) га 61.34 
MSres 9.40 

is used to test the hypothesis that z; = (тү + т + 74)/3. Тһе critical value 
for a .01-level test of this hypothesis Ғ.,(1,12) = 9.33. The observed data 
contradict this hypothesis. If this comparison were suggested by inspection 
of the data, the procedure given by Scheffé (described in Sec. 3.9) would be 
used to obtain the critical value. The latter critical value for a .01-1еуе1 
test is (k — 1)F p(k — 1, df) = ЗЕ,/3,12) = 3(5.95) = 17.85. Even 


Table 4.3-2 Analysis of Variance 


Source of variation 55 df MS | F 
Between people 680.80 4 
Within people 811.00 15 
Drugs 698.20 3 232.73 | 24.76** 
Residual 112.80 12 9.40 
Total 1491.80 19 


** F 00(3,12) = 5.95 


with this critical value, the data indicate that drug 3 is different in its effect on 
reaction time from the effects of the other three drugs. 

To test the hypothesis that тү = 7; = 74, the sum of squares for these 
three drugs is given by 
Ti Tic Ti (ТТ)? 


SSarugs, 2,4 


n 3n 
132% + 128? + 160° — (132 + 128 + 160) 
5 15 
= 121.60. 
The mean square corresponding to this sum of squares is 
= 60.80. 


121.60 
MSarugs 1, 2,4 = ao 


The statistic used in the test is 
MSarugs1,2,4 _ 60.80 6.47. 
MSres 9.40 


For a .01-level test, the critical value of this statistic is Е, (2,12) = 6.93. 
Although the observed F statistic does not exceed the critical value for a 
-01-level test, the observed Fis large enough to question the hypothesis that 
the drugs 1, 2, and 4 are equally effective with respect to reaction time. 
Inspection of the drug totals in Table 4.3-1 indicates that drug 4 has a 
Somewhat longer reaction time, but the evidence is not quite strong enough 
to establish this conclusion at the .01-level of significance. 


Jal 
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The data, in this case, can be adequately summarized in terms of a few 
selected comparisons. Analogous conclusions can be reached by other, 
somewhat more systematic probing procedures. Any of the methods 
discussed in Secs. 3.8 and 3.9 may be used to test the difference between all 
possible pairs of means. In such tests MS,e has the role of MS. 
Application of the Newman-Keuls method is illustrated in Table 4.3-3. 


Table 4.3-3 Tests on Differences between Pairs of Means 


Drugs 3 2 1 4 


Totals | 78 128 132 160 


78 — 50 54 82 


3 
А 2 128 не 4 32 
o 1 132 ES LE 
4 160 aie 
(ii) 4ә(12)| 432 | 5.04 | 5.50 
(iii) УМ, 9 4012) | 29.64 | 34.57 | 37.73 
3 2 1 4 
a жж жж жж 
в Ie uai 
4 E 


With repeated measures, barring missing data, the numbers of observations 
under each treatment will be equal. In this case treatment totals may be 
used rather than treatment means. The drug totals, in increasing order of 
magnitude, are given in parti. The entry in а cell of part i is the difference 
between a total at the head of a column and a total to the left of a row. 
Critical values for the statistic 

25-5 


= М5” 


where r is the number of steps two totals are apart оп an ordered scale, are 
given in part ii. These values are obtained from the first three columns of 
tables for the 99th percentile point for the q statistic; the degrees of freedom 
for this q statistic are the degrees of freedom of MS, Critical values for 


T; — Ty = 4, VnMSres 


are given in part iii. In this case 


V/nMSres = 4/5(9.40) = 4/47.00 = 6.86. 
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Thus the entries in part iii are 6.86 times the corresponding entries in part ii. 

The order in which tests are made is given in Sec. 3.8. The critical value 
for the difference Т, — Т; = 82 is 37.73. Hence the data contradict the 
hypothesis that 74 = тз. Тһе difference T, — T, = 54 has the critical value 
34.57, and the difference T, — Т, = 50 has the critical value 29.64. Тһе 
difference Т, — Т, = 32 has the critical value 34.57; this difference does not 
quite exceed the critical value. No further testsaremade. The tests which 
yield statistically significant results are summarized in part iv. Drug 3 
appears to be different from the other drugs in its effect on reaction time. 
Although the differences between drug 4 and drugs 2 and 1 are relatively 
large, the differences do not exceed critical values of a .01-level test. This 
latter result is consistent with the outcome of the test of the hypothesis that 


Table 4.3-4 Variance-Covariance Matrix 
Drug 1 Drug2 | Drug3 | Drug4 


Drug 1 76.80 53.20 29.20 69.00 


Drug 2 42.80 15.80 47.00 
Drug 3 14.80 27.00 
Drug 4 64.00 


7, = Ty = т, This hypothesis was not rejected at the .01 level of signif- 
icance, but the observed F statistic was close to the critical value. 

The computational formula for MS,,, is algebraically equivalent to the 
expression 

MSres = ty == TOV x, 
where §% is the mean of the variances within each of the drug conditions and 
COV, is the mean of the covariances between the pairs of observations under 
any two drug conditions. To show this equivalence for the numerical data 
in Table 4.3-1, the variance-covariance matrix for these data is given in 
Table 4.3-4. 

The variance of the observations made under each of the drugs appears 
along the main diagonal of this table. For example, the variance of the 
observations made under drug | is 
a  UX?—(Tij/n) 3192 =з» Di 


Si 


n—1 
The covariances appear above the main diagonal. For example, the co- 
variance between the observations under drug 1 and those made under drug 
2 is 
X(X4X3) = (Тп) 
nui 

(30)(28) + + ++ + (26)(28) — [(132)(128)/5] 53.20. 

z 4 


Соух, — 
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The mean of the variances is 49.60; the mean of the covariances is 40.20. 
Thus, 


5 — TOV y = 49.60 — 40.20 = 9.40. 


The numerical value of MS,e obtained by the computational formula is 
also 9.40. It is considerably more work to obtain MS,,, from the variance- 
covariance matrix than it is to obtain MS,es by means of the computational 
formula. However, in order to check certain of the assumptions underlying 
the F test, computation of the variance-covariance matrix is sometimes 
required and is often enlightening in its own right. 

MS between people 15 also related to 52 and соу у. This relationship is 


MSpetween people = sk + (k — 1) соу у. 
From Table 4.3-2, 
SSpetween people = 680.80 
k—1 


— 170.20. 


MShetween people 


Table 4.3-5 Intercorrelation Matrix 


Drug 1 Drug 2 Drug 3 Drug 4 


Drug 1 1.000 917 .860 978 
Drug 2 1.000 .620 .887 
Drug 3 1.000 877 
Drug4 1.000 


In terms of the average variance and the average covariance, 
MSpetween people = 49.60 + 3(40.20) = 170.20. 


The matrix of intercorrelations for the data in Table 4.3-1 is given in 
Table 4.3-5. The average intercorrelation of the off-diagonal elements is 
? = 86. Had a common estimate of the population variance been used in 


computing these intercorrelations, this average would have been /” = .81. 
In terms of this latter average, 


MSres = 5x(1 — P) = 49.60(1 — .81) = 9.42. 


Had the data been uncorrelated, MS,, would have been equal to 49.60. 
The larger the average intercorrelation, the smaller will be MS, 


гез" 


4.4 Statistical Basis for the Analysis 


The validity of tests in the last section rests upon a set of assumptions 
about the nature of the underlying sources of variation. The case in which 
the number of treatments is equal to 2 will be considered first. The assump- 
tions that will be made for this case are actually stronger than those required; 
however, all the assumptions will be required for the case in which the 
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number of treatments is greater than 2. Suppose that each of the observa- 
tions may be expressed in terms of the strictly additive model given below: 


Person Treatment 1 | Treatment 2 
1 Ay = OF mae т 61 Xe = и +m + т + ёз 
n Xm = н + т + тү + m | Хә = Н + т; + т; + ет 
Ту = mp + Em; + nr X | T, = пи + От, + nr + Хет 
T= и+ т tnta | hasut t+ nt d 


EUCH; SO TIN HRS | ee 
The notation is defined below: 


X;; = an observation on person i under treatment j. 


Щщ = mean of all potential observations under treatment 1, that is, mean for 
treatment 1 if the entire population of people were observed under 


treatment 1. 

lz = mean of all potential observations under treatment 2. 

7, = a constant associated with person і. In the population of people the 
mean of the т; is assumed to be zero. 

u = (д, + иь)/2 = grand mean of all potential observations. 

Tı = шщ — и = main effect of treatment 1. 

To = рь — и = main effect of treatment 2. 

ғу = experimental error associated with X;;. 

= all sources of variation in X; except those accounted for by the 7's and 


the z's. 


Treatments 1 and 2 will be assumed to constitute the population of treat- 
ments. From the definition of 7, and 7a, it follows that, + т, = 0. Since 
the n people in the experiment are assumed to be a random sample from a 
potentially infinite population of people, т; is a random variable. In the 
population of people, 7; will be assumed to be normally distributed with 
mean zero and variance o>. 

Within the population of potential observations under treatment 1, the 
experimental error ғ, is assumed to be normally distributed, with mean 
equal to zero and variance equal to 02. Within the corresponding popula- 
tion under treatment 2, the experimental error is also assumed to be normally 
distributed, with mean equal to zero and variance equal to б, Тһе (мо 
distributions of experimental error will be assumed to be independent; 
further it will be assumed that 


Ce 8:22 224 
б — б, = б, 


i.e., the error variances are homogeneous. 
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Since и and т, are constant for all observations under treatment 1, the 
variance of the X’s within treatment population | isa function of the variance 
due to т, and єл. Assuming т, and ғ, uncorrelated, 


2 2 
ox, =o; + бу. 


In words, under the assumptions that have been made, the variance of the 
potential observations under treatment | is the sum of the variance due to 
experimental error and the variance due to differences between the z's. 
Similarly the variance due to the potential observations under treatment 2 is 


2 
2+ of. 


Ox, 73.0, 

Since the term 7; is common to two measurements on the same person, 
the covariance between X, and X, will not in general be equal to zero. 
All the covariance between X, and X, is assumed to be due to the term т; 
If this covariance is denoted by the symbol Суух, then 


eEVE | 
ON NX, = üz: 


Under the assumptions that have been made, 


2 2 
9x; + Ox, 


2 = 0? 4- оў. 
2 2 
ox, + ох 2 
Hence oy x, = 0h 


2 

The computational formula for MS,.., given in Sec. 4.2, is equivalent to 

the left-hand side of the above expression if statistics are substituted for 

corresponding parameters. MS,,, provides an unbiased estimate of о; in 
symbols, this last statement is expressed by 


E(MSres) = o?. 


The expectation in this case is with respect to random samples of size л. 
Under the assumptions made, the sampling distribution of (n — 1)MS,,./o? 
may be approximated by a chi-square distribution having n — | degrees of 
freedom, 
From the structural model it is seen that 
T, — Т» = (тү — 72) + (& — &). 

Since the same people are observed under both the treatments, this difference 
is free of any effects associated with the z's. The variance of the quantity 
T, — Т,, when the experiment is replicated with random samples of size л 


people, has the form ^ 


— 
On a0, = i-a + SU 
This expression assumes that т, — т, and & — & are uncorrelated. Ап 


equivalent expression foro? , is 
1- 53 


Op = о + % 
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This last expression is implied by the previous expression, since the variance 
of a variable is a function of the differences between all possible pairs of the 
variables. Multiplying both sides of this last expression by n gives 


nop = по? + no? 


For random samples of size n, it has been shown in earlier chapters that 
по? = o2. It may further be shown that Мб is an unbiased estimate of 
no^. Hence 

т E(MStreat) = nop = по? + o2. 

Under the assumptions that have been made, when o? — 0 the sampling 
distribution of MS,,.,./o? may be approximated by a chi-square distribution 
having one degree of freedom. Further the sampling distribution of MS rent 
is independent of the sampling distribution of MS,,. Thus the statistic 


MStreat 
MSres " 


under the hypothesis that о? = 0, respresents the ratio of mean squares 
having independent chi-square distributions. Further, when o? = 0, the 
numerator and denominator have expected values equal to o}. Hence the 
F statistic has a sampling distribution which may be approximated by an F 
distribution having one degree of freedom for the numerator and л — 1 
degrees of freedom for the denominator. More explicitly, 


F= 


when o? = 0. 

The model under which this last F ratio was obtained did not include a 
possible person by treatment interaction. The more inclusive model is 
Х =U +m ту + тту + Ei 
The person by treatment interaction, тт,,, is a measure of the reliable effect 
associated with person i which is exhibited in the presence of treatment j and 
which cannot be predicted from knowledge of т; and т). The interaction 
effect лт, is considered to be a random variable, normally distributed, with 
mean equal to zero within each of the treatment populations, and with 


variance ENT 2 


zn = Snr, — Orr 


If the тт, /5 are defined to be uncorrelated (a less restrictive assumption is 
made later in this section) with the other terms on the right-hand side of the 


model, it may be shown that 

E(MStreat) = о; + Ger + по», 

E(MSres) = 0 + Orr 
The sampling distribution of the ratio of these two mean squares is still 
approximately an F distribution with one and n — 1 degrees of freedom. 
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In this design c? and o7, cannot be estimated separately. Only their sum 
may be estimated. Since they cannot be estimated separately, there is no 
real need to have both effects in the model; if there is a person by treatment 
effect, it is completely confounded with the residual or experimental error. 
In designs which are discussed in later chapters, it is possible to obtain a 
direct estimate of ož. 

There are alternative approaches which justify the use of the final F ratio 
as а test of the hypothesis that o? = 0. However, the approach that has 
been followed leads more directly to the general case in which the number of 
treatments is any number k. For this general case, an observation Х,, on 
person / under treatment / may be expressed in terms of the strictly additive 
model as 

Xi — uM m M 7; + г. 

The terms on the right-hand side of this model have definitions analogous to 
those given for the case in which k = 2. The variables т, and ғ,, are 
assumed to be normally distributed, with expected value equal to zero within 
each of the treatment populations. It is also assumed that т, and e,, are 
uncorrelated. Since the same people are observed under each of the treat- 
ment populations, т; is assumed to be the same for all treatments. It will 
also be assumed that the variance due to experimental error, c?, is the same 
for all treatment populations. 

The subjects in a single experiment are assumed to be a random sample of 
size n from a population of N people, where М is quite large relative to п. 
The population variance for treatment j is defined as (the summation is over 
the population of subjects) 


Хх; T u 
к= 
where ш; = XXuN is the population mean for treatment j. The popula- 


X; 


E 


D 
tion covariance between Х,, and X,;., where j and j’ represent two different 
treatments, is defined as 


XX, = uyKXiy — uy) 


N=‘ 

= Рх,хубх,бх,» 
The symbol Px,x,, denotes the correlation between observations in treat- 
ment population j and treatment population j’. (In designs which do not 
involve repeated measures on the same people this correlation will be zero.) 
If the underlying model is correct, the covariance between any two treat- 
ments is due solely to the presence of т; in pairs of measures on the same 


people. From this it follows that all the population covariances will be 
equal; i.e., 


9 x,xy. 


Oy;x; = c? for all treatment populations. 
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Further, since т, and e; have been assumed to be uncorrelated, with o? being 
constant for all treatment populations, it follows that the variance of the 
potential observations within each of the treatment populations may be 


expressed as x à 
oy = 07 + o. 
3 —— Г «3% 
Hence È = су — OF = 0x — ON, xy 


In words, the variance attributable to experimental error is equal to the 
within-treatment variance minus the covariance between treatments. 

Each of the terms in this last expression is a parameter. Unbiased esti- 
mates of these parameters are available from experimental data, provided 
that experimental data are obtained under conditions which approximate 
those assumed by the structural model. The mean of the within-treatment 
variances for the experimental data provides an unbiased estimate of o%; 


that is, Жел. " 

E(3) = Eris) = o. 
Similarly the mean of all the covariances will provide an unbiased estimate 
ооуу. Тһе mean of these covariances will be designated by the symbol 


cov. It may further be shown that 

E(5% — cov) = 02. 
The computational formula for MS,,, given in Sec. 4.2 may be shown to be 
algebraically equivalent to the expression in parentheses. Thus 

Мб = 5% — COV. 

Hence E(MSres) = 02. 
The expectation in this case is with respect to replications of the experiment 
with random samples of л subjects. Under the conditions that have been 
specified, the sampling distribution of (k — Da — 1)MS,,,/o2 may be 
approximated by a chi-square distribution having (k — (и — 1) degrees 


of freedom. қ 
The difference between the mean of the n observations under treatment 


j, T;, and the mean of the n observations under treatment j’, Ty, may be 
expressed in terms of the linear model as 

T, — Ty s (ау тй rg) 
Since the same people appear under both the treatments, a term involving 
the difference between two z's does not appear in the above expression. 
Since it has been assumed that the treatment effects and the experimental 
error are uncorrelated, the variance of the difference on the left-hand side of 
the above expression will provide an estimate of the sum of the variances of 
the differences on the right-hand side. In symbols, 
nXT, 13), Ty = ор + поў. 


MStreat = k(k 521) 
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The computational formula for MStreat given in Sec. 4.2 is algebraically 
equivalent to that appearing on the left-hand side of the above expression, 

Under the hypothesis that o? = 0, the sampling distribution of (k — 1) 
MS ireai/0z may be approximated by a chi-square distribution having k — 1 
degrees of freedom. Further, under this hypothesis both М5, „„; and MS 
provide independent estimates of o2. Hence the F ratio 


res 


7 Еа MStreat 
MSres 


has a sampling distribution approximated by an Fdistribution having k — 1 
and (k — (и — 1) degrees of freedom. If оў were greater than Zero, 
the numerator of the F statistic would have an expected value which differs 
from the expected value of the denominator by an amount equal to ло? 
Hence numerically large values of the F ratio will tend to contradict the 
hypothesis that o? = 0. In terms of expected values, 


E(MSueu) — o2 + no? 
E(MSres) өз 


с 


E(F) = 


=1.00 when o? = 0). 


In the structural model from which the expected values were obtained, no 
person x treatment interaction appears. If this interaction is defined as 


TT; = E(X;;) neu cm. 
then the structural model has the form 


In this model £,; is assumed to be normally distributed with variance о? 
within each of the treatment populations and Statistically independent of the 
other terms on the right-hand Side; т; is assumed to be a constant associated 
with treatment j; т, and 77,; are assumed to be random variables having a 


In order that the usual F test on the main effects due to the treatments be 
valid, the following restrictions are imposed upon the model: 


©х,х, = pox for all 7% and j^s, 
Under these conditions it can be shown that 
E(MSpetween people) = 02 + Ко? = ЕП + (К — 1)р1, 
E(MStreat) = 02 + 02, + по? = oX(1— р) + по?, 
E(MSres) = о? + o2, = ао): 


т 


Непсе MSbetween people = te + (k — 1) cov, 


MSres = Se — cov. 


A formal proof of the equivalence of these expected values is given by 
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Scheffé (1960, pp. 262-267). The F ratio in the test of the hypothesis 
c? = Ostill has the form 


MS 
F = treat 3 
MSres 
The assumptions of homogeneity of within-treatment population variance 
oy, = оу,= *** = сү, and homogeneity of covariance 
9,22 2 0414, D EE Lin 


which underlie the validity of the usual F test for this design, are highly 
restrictive. That is, the data must conform to a prescribed pattern of 
variances and covariances before the statistical test can be considered exact. 
Box (1953) has indicated that the usual F test in case of uncorrelated data is 
relatively robust (insensitive) with respect to violation of the assumption of 
homogeneity of variance. That is, for uncorrelated data, violation of the 
assumption of homogeneity of variance does not seriously bias the final F 
test. 

Box has shown, however, that heterogeneity of both the variances 
and covariances in a design having correlated observations will generally 
result in a positive bias in the usual F test. That is, the critical value as 
obtained from an F table tends to be too low relative to a critical value 
appropriate for an arbitrary variance-covariance matrix. (The usual F test 
is appropriate only for a very special variance-covariance matrix.) For 
the case in which the variance-covariance matrix is arbitrary in form, an 
approximate test may be made through use of the usual F statistic, but the 
degrees of freedom are taken to be (k — 1)0 and (k — Іп — 1), where 0 
is a number which depends upon the degree of heterogeneity of the variances 
and covariances. When the assumptions of homogeneity are met, 0 equals 
unity. The greater the heterogeneity, the smaller 0 is. The smallest 
possible value that 0 can assume is 1/(k — 1). ү 

If one assumes that 0 is actually equal to this lower bound, the resulting 
test will tend to err on the conservative side. That is, the F value deter- 
mined from an F table will tend to be somewhat larger than the exact value. 
(This kind of test is said to be negatively biased.) The degrees of freedom 
for the numerator of the F ratio, assuming that 0 = 1/(k — 1), are 


k—1 
ӯ 5 
«== 
The degrees of freedom for the denominator, assuming that 0 = 1/(k — 1), 
are 
- 1)(К — 1 
(n — 1k — 00 = (п 5 X кү 


Hence the conservative test assumes that the F ratio has one and m 1 
degrees of freedom. When the assumptions of homogeneity of variances 
and homogeneity of covariances are questionable, the conservative test indi- 


cated in this paragraph provides an approximate test. 
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For the case in which k > 2 and n > k, Hotelling's 7? statistic may be 
used to test the hypothesis that о? = 0. This test is exact if the underlying 
distributions are multivariate normal. Use of Hotelling’s T? statistic 
requires no assumptions of homogeneity of covariances on the population 
variance-covariance matrix. Computationally, Hotelling’s 7? statistic is 
more complex, sinceit involves the inverse of the covariance matrix. When 
the homogeneity assumptions are met, the T? statistic is algebraically 
equivalent to the usual F statistic. For a more detailed discussion of this 
Statistic see Anderson (1958, pp. 101-105) and Geisser (1959). Anumerical 
example of the 7° statistic appears іп Appendix A.6. 

Procedures for testing homogeneity hypotheses about population co- 
variance matrices are described in Sec. 7.7. In particular 73 as defined in 
Sec. 7.7 may be used to test the hypothesis that the population covariance 
matrix has the form 


This is the form specified by the model used in this section to justify the 
subsequent F tests. In the definition of 7? in Sec. 7.7,q corresponds to the 
number of treatments; in the notation of this section, k = q, and N = nk. 


4.5 Use of Analysis of Variance to Estimate Reliability 
of Measurements 


Given a person possessing a magnitude т of a specified characteristic. 
In appraising this characteristic with some measuring device, the observed 
score may have the magnitude т + 7. The quantity 7 is the error of meas- 
urement; all measurement has some of this kind of error. The latter is due 
in part to the measuring device itself and in part to the conditions surround- 
ing the measurement. In the development that follows, it will be assumed 
that the magnitude of the error of measurement is uncorrelated with z. A 
measurement on person with measuring instrument jmay be represented as 


(1) Xi; = 0; + Nis 
where X,; = observed measurement, 


7, = true magnitude of characteristic being measured, 
ij = error of measurement. 


Upon repeated measurement with the same or comparable instruments, 


m; 18 assumed to remain constant, whereas n, is assumed to vary. Themean 
of k such repeated measures may be represented as 


0) рет, 
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A schematic representation of a random sample of k measurements on the 
same or comparable measuring instruments is shown in Table 4.5-1. If 
7, remains constant for such measurement, the variance within person i is 
due to error of measurement and the pooled within-person variance also 
estimates variance due to error of measurement On the other hand, the 
variance in the P'sis in part due to differences between the true magnitudes of 
the characteristic possessed by the n people and in part due to differences in 
the average error of measurement foreach person. The analysis of variance 


Table 4.5-1 Estimation of Reliability 


Comparable measurements | 
Person |= ———————— Total | Mean 
1 2 j k 
1 Ха Xi Ху Ху Р, P, 
21 22 Xo; 2k P, P, 
i Ха Хә Ху Хь Р; Р, 
n Xa Хаз Xni Xu Р, | P, 
Total Т} Th ee Шз, Caen m G 


and the expected values for the mean squares for data of the type shown in 
Table 4.5-1 are given in Table 4.5-2. MS, i people 1$ defined as 
КХР, — С)? 
MSpetween people = БЕТТЕ , 


whereas the variance of the P’s is given by 


2 2A 0) 
ар nou 
Thus MSpetween people = ksp. 


In terms of (2), the expected value of the variance of the P’s is 
Е(5%) = 0? + о. 


The quantity o? is the variance of the true measures in the population of 
which the n people in the study represent a random sample. From 


2 
the relationship between MSpetwcen people and у}, 

2 [d Ng 2 

E(MS»etween people) = Кой + ko, = CP Кор, 


since Коў = ой. 
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The reliability of P,, which is the mean of k measurements, is defined to be 


о? аё 


: hdd of (E) 

In words, the reliability of the mean of k measurements is the variance due 
to the true scores divided by the sum of the variance due to the true scores 
and variance due to error of measurement. From Table 4.5-2, it is noted 
that MS, „оре provides an estimate of o} and MSpetween people provides ап 
estimate of o; -+ Кот. Hence 


(MSpbetween people — MSw. people) estimates kož, 


1 2 4 
апа q MSbetween people — Му. peopie) estimates 02. 


Table 4.5-2 Analysis of Variance 


Source of variation | MS EMS) 


Between people | MSyetween peole | 0 + ko? 


Within people | М5, моме | 02 


Thus, the estimated reliability for the mean of k measurements, upon sub- 
stituting in (3), is 
(4) Ty (1/k)(MSpetween people — MSw. people) 

(L/K)(MSpetween people — MS. people) ір (1/k)MSw. people 


мз: MSpetween people — MS,. people 


MSpetween people 
= MSw. people 4 
MSdetween people 
The reliability of Х,,, which is a single measurement, is defined as 


(9 р = 
The estimate of this reliability is 
(6) n= C/k)(MSbetween people — MSw. people) 

(1/k)(MShetween people — MS,. people) + MS,. people 

a MShetween people — М5». People 

MSbetween people F (k деу 1)MSy. people : 

In terms of the relations discussed in the last section, 
cov cov 
cov + (б — cov) 3% 

In words, the reliability of a single measurement is estimated by the average 
of intercorrelations between the measurements when a pooled estimate 


т = 
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of the variance is used in the denominator of eachof the correlations. This 
latter statistic will be quite close to the mean of the actual intercorrelations 
when the within-column variances are approximately equal. 

The reliability of the average of k measurements may be expressed in 
terms of the reliability of a single measurement. By straightforward alge- 
braic rearrangement of the terms in (4), it may be shown that 


ж kr, 
IFES Dr 


Conversely, by substitution of the expression for r, in (7), the latter will 
assume the form given in (4). Given the value of гу, (7) may be used to 


(7) T 


Table 4.5-3 Numerical Example 


Person Judge 1 | Judge 2 | Judge 3 Judge 4 | Total 
1 gen s| 4 | 3 32 | 12-P 
2 5 | 7 | 5 6 23=Р, 
3 1 | 3 1 2 1=P, 
0 4 7 9 9 8 33 =P, 
5 2 | 4 6 1 13 =P; 
6 6 | 8 | 8 4 26 = P, 
Total | 234 35 32 24 114=G 
| T, T, | Тз T, 
2 IT? IP? 
(ii) (1) = 2 = 541.50 (2) = X(ZX?) = 700 (3) = = = 559.00 (4) = = = 664.00 
n 


nn EE 


SSpetween people = (4) — (1) = 122.50 

St Б {лш т. 
iii SS, = (3) = ее ЛІ 
(iii) Pari OE Wn) 2.1850 

SStotal = (2) -(@) = 158.50 


a ___— 


estimate the reliability of the mean of any number of comparable measures. 
In the area of psychometric (7) is known as the Spearman-Brown prediction 
formula. The assumptions underlying the validity of this formula are those 
underlying the analysis of variance given in Table 4.52. Theseassumptions 
are thatthe error of measurement is uncorrelated with the true score, thatthe 
sample of л people on whom the observations are made is a random ien 
from the population to which inferences are to be made, that the vids e s 
k measuring instruments used is a random sample from a population - 

comparable instruments, and ath the within-person variance may be 

ide an estimate of c7. . 2! 

BE date in Га 4.5-3 will be used to illustrate the numerical application 
of reliability estimation. In this example, four judges were asked to ме 
each member of a group of six people оп а specified characteristic. е 
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data in the columns of part i represent the ratings made by each of the 
judges. The data in the rows are the four ratings for a single person. For 
example, the ratings on person 1 are respectively 2, 4, 3, and 3. The totals 
of the ratings on each person are given under the column headed Total. 
In the numerical work, the judges have the role of the treatments. 

Inpartiithecomputational symbols used to obtain the sums of Squares are 
Shown. These symbols have the same definitions as those given in Sec. 4.2. 
The sums of squares are computed in part iii. Only between- and within- 
people sums of squares are required to estimate the reliability as defined by 
(4. The additional sums of squares that are computed will be discussed 
in connection with an alternative definition of reliability. The analysis of 
variance is summarized in Table 4.5-4, 


Table 4.5-4 Analysis of Variance 
ке_—Б ——.——_—__—.... 


Source of variation | ss df MS 
Between people | 122.50 5| 24.50 
Within people | 36.00 18 2.00 
Between judges 17.50 3 5.83 
Residual | 18.50 15 1.23 
Total | 155504 2777221 9” 


The estimate of reliability of the average of the four ratings made on each 
of the people, based upon (4), is 


MS. people 1 2.00 
MShetween people 24.50 
One interpretation of this reliability is as follows: If the experiment were to 
be repeated with another random sample of four judges, but with the same 
people, the correlation between the mean Tatings obtained from the two sets 
of data on the same people would be approximately .92. This interpreta- 
tion assumes that the variance due to differences between the mean ratings 
made by the judges is part of the error of measurement and does not repre- 
sent a systematic source of variation, 

On the other hand if the mean of the ratings made by individual judges 
represents a systematic frame of reference for the individual judges, then the 
source of variation due to differences between these means should not be 
considered part of the error of measurement. Adjustments for differences 
in frame of reference may be computed as follows: 


м = 1 


2 


Judge 1 | Judge 2 Judge 3 | Judge 4 


Total 23 35 32 24 114 


Mean | 3.83 | 5.83 5.33 4.00 | 4.75 =G 
Deviation from С | —92 | 108 59 —.75 
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The deviation of the mean rating of a judge from the mean rating of all 
judges defines what may be called an adjustment for the frame of reference. 
If, for example, the quantity 1.08 is subtracted from all the ratings made by 
judge 2, the mean of the resulting adjusted ratings will be equal to the grand 
mean. Similarly, if the quantity —.92 is subtracted from all ratings made 
by judge 1, the mean of his adjusted ratings will be equal to the grand mean. 
For data adjusted in this way the within-people variation is free of any source 
of variation which is a function of differences in frames of reference for the 


judges. 


Table 4.5-5 Numerical Example (Adjusted Data) 


Person Judge 1 Judge 2 Judge 3 Judge 4 Total 
1 l 2.92 | 2.92 2.41 3.75 12 
2 5.92 5.92 4.41 6.75 23 
(i) 3 1.92 1.92 41 2.75 7 
4 7.92 7.92 8.41 8.75 33 
5 2.92 2.92 5.41 1.75 13 
6 6.92 6.92 7.41 4.75 26 
Total | 28.52 28.52 28.46 28.50 114 
— rs prp eee Se 
IT? 
(ii) (1) = 2 = 541.50 (2) = ХХХ? = 682.50 (3) = = = 541.50 
п 
EP? 
(4) = = = 664.00 
k 
SShetween people = (4) - (1) = 122.50 
2 people =) - (4) = 18.50 
(iii) SSpbetween judges = (3) - (1) 13 -00 
SSres = (2) — (3) — (4) + (1) = 1850 
SStotal -(2)-() - 141.00 


The data in Table 4.5-3, adjusted for differences in frames of reference, 
are given in Table 4.5-5. The entries under Judge 1 are obtained by sub- 
tracting the quantity —.92 from the corresponding entry under Judge 1 in 
Table 4.5-3, The entries under Judge 3 are obtained by subtracting 59 
from the corresponding observed data. The entries in part i of Table 4.5-5 
represent the ratings adjusted for frames of reference. The sum of the 
adjusted ratings for each of the people rated is the same as the corresponding 
sum for the observed ratings. The column totals for the adjusted ratings 
are, except for rounding error, all equal. Thus for the adjusted ratings 
there are no differences between the mean ratings given by each judge. The 
computational symbols associated with the sums of squares are given in 
part ii, and the sums of squares are given in part iii. The SSpetween people 18 
not affected by the adjustment procedure. However, 56, peope for the 
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adjusted data differs from the corresponding sums of squares for the unad- 
justed data by an amount equal to 58.4... Thus the effect of the adjust- 
ment for frames of reference is to eliminate the between-judge variation 
from the within-person variation. 

The analysis of the adjusted data is given in Table 4.5-6. Since the varia- 
tion due to the judges is eliminated from the within-person variation in the 
course of the adjustment process, the degrees of freedom for this source of 
variation are also removed from the degrees of freedom for the within- 
person variation. Hence the degrees of freedom for the adjusted within- 
person variation are 18 - 3 = 15. Itshould be noted that the М5,, Jo 


Table 4.5-6 Analysis of Variance (Adjusted Data) 


Source of variation SS df MS 
Between people 122.50 5 24.50 
Within people 18.50 15' 1.23 

Total 141.00 20’ 


for the adjusted data is numerically equal to the MS,,, for the unadjusted 
data. In terms of the adjusted analysis of variance, the reliability of the 
mean of four ratings is, by using (4), 


MShetween people — MSy, people (adj) 


Ta 
MS»etween people 


_ 24,50 — 1.23 _ 
p M 
This estimate of reliability could have been obtained from Table 4.5-4 by 
using MS,, in place of MS, еее in (4). The estimate of the reliability 
of a single rating for the adjusted data, by using (6), is 
. 24.50 — 1.23 

24.50--(4 — 1123) ” 
The reliability of a single rating for the adjusted data is approximately equal 
to the average intercorrelation between ratings given by pairs of judges. 


The intercorrelation between pairs of judges is not changed in the adjust- 
ment process. The actual intercorrelations are: 


ee sais d ша ca hd 4. обы! гь ЬЩ 
Judge 1 | Judge 2 Judge 3 | Judge 4 


1 


Judge 1 1.00 .87 .87 
Judge 2 87 .87 
Judge 3 59 


DL SENE НЕШ | +! 
The average of these correlations is .84. 
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By means of (7) the reliability of the mean of four ratings may be obtained 
from the reliability of a single rating. Thus, 
— 4(.83) = 

fpe UR) C 

Whether the equivalent ofthe variation between judgesis to be considered 
part of the error of measurement or whether this source of variation is to be 
considered systematic variation that is not part of theerror of measurement 
depends upon the use that is to be made of the resulting data. If the zero 
points on the scales of measurement are of no importance, then differences 
due to the equivalent of frames of reference should not be considered part of 
the error of measurement. In this case the estimates of the reliability are 
given by 


T4 


(4) r, MShetween people 7” MSres 
k , 
MSnpetween people 


(6) n MSnetween people — MSres 3 
MSpetween people ДЕ (k = 1)MSres 
Table 4.5-7 represents data that would be obtained from the administra- 
tion of atest ofkitems. The totals P}, Ps, . . . , P, represent the test scores 


Table 4.5-7 Representation of Test Scores 


Person | Item1 | Item2 | --: | Item Test score 
1 Ха Ха 
2 21 22 
n Ха Ха 
rue 


for the people taking the tests, provided that the score on the test is obtained 
by summing the scores on the individual items. An estimate of the d 
bility ofthe test may be obtained from (4’). The reliability of a total is the 
same as that of a mean score. The computational procedures used in 
Sec. 4.2, with the test items having the role of the treatments, will provide 
the quantities required in (4). If the test items are scored 1 for correct 
and 0 for incorrect, the only information required to obtain an estimate 
of the reliability of the test is the scores on the tests and the number of 
correct responses to each item. (In cases where the observed data consist 
of 0’s and 1’s, EX = EX?) The error of measurement in this case may be 
interpreted as a measure of the extent to which people having the same test 
scores do not have identical profiles of correct responses. For person 1, 


the profile of responses is gis Aaa; pae 
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One additional point about the model underlying reliability should be 
made explicit and emphasized. Just as in the basic model for correlated 
observations discussed in Sec. 4.4, one must assume that thez,,’s are constant 
underall measurements made. This assumption implies that the correlation 
between the judges (tests, items) be constant. In particular the analysis-of- 
variance model cannot be used to estimate reliability when the true score 
changes irregularly from one measurement to the next, as, for example, 
when practice effects are present in some nonsystematic manner. If, 
however, changes in the underlying true score are systematic and constant 
for all subjects, then adjustments for this change may be made by eliminat- 
ing variation due to change from the within-subject variation. 


4.6 Tests for Trend 


The methods of curve fitting discussed in Sec. 3.7 may be adapted for use 
in connection with experiments involving repeated measures. These 
methods will be illustrated by means of a learning experiment. 


Table 4.6-1 Analysis of Learning Data 
_+е_-—-————————_—— 


Source of variation SS df MS р 
Between subjects 90.00 9 10.00 
Within subjects 163.88 60 
Blocks 103.94 6 17.32 | 18.23 
Residual 51.30 | 54 .95 


—— мса таана _ 

А sample of 10 subjects is used in the experiment to be described. Each 
Subject is given 28 trials in which to learn a discrimination problem. The 
trials are grouped into blocks of 4 trials each. Тһе block is considered as 
the observational unit. Hence the data are analyzed as if there were 7 
observations on each subject, an observation being the outcome of a series 
of 4 trials. The degrees of freedom for the between-subject variation are 
9; the degrees of freedom for the within-subject variation are 60, 6 degrees 
of freedom for each of the 10 subjects. 

A summary of the over-all analysis appears in Table 4.6-1. For « — .01 
the critical value for a test of the hypothesis that there is no difference in the 
block means is F,,(1,54) = 7.14. The data indicate that there are signif- 
icant differences between the block means. A graph of the block totals is 
given in Fig. 4.2. Inspection of this figure indicates that a straight line 
would provide a good fit to the points, but there is also some evidence to 
indicate that an S-shaped (cubic) curve would provide a better fit. The 
nature of the subject matter also suggests that a cubic curve would be more 
appropriate for these data. Hencea point to be investigated is whether or 
not a cubic curve provides a better fit than a straight line (within the range of 
blocks included in this study). 


TESTS FOR TREND 133 


Toward this end, the mean squares corresponding to the linear and cubic 
trends are computed in Table 4.6-2. Тһе block totals for these experimen- 
tal data are given near the top of Table 4.6-2. Each of these totals is the sum 


Block totals 


3 
Trial blocks 
Figure 4.2 


over 10 observations. Since there are seven blocks, the coefficients corre- 
sponding to the linear, quadratic, and cubictrends are obtained from the set 
of coefficients for which k = 7in Table B.10. The entries under the column 
headed Xc? represent the sums of the squares of the coefficients in the cor- 
responding rows. The entries under the column headed C represent the 


Table 4.6-2 Tests for Trends 
Blocks: 1 2 3 4 5 67 


Block totals: 2 5 7 318 28 30 0633 


Linear —3 2 581 0 1 2 3|28 166 9841" 
Quadratic 5 0 23 24 22206 07/2 84 -2 00 
-] 1| 6-16 427% 


Cubic —1 1 1 0 —1 


OO, ES лм сты шс + 


numerical value of the comparisons. For example, the linear comparison is 


(—3)(2) + (—2)(5) +(-DM) 3 (018) + (1)(28) + (2/31) + (3)(33) = 166. 
е linear comparison is 


The mean square corresponding to th 
Chm _ (166 _ 98.41. 

nXc 1008) 

A test on the significance of the linear trend is given by the F ratio 


MSiin = 


_ MSin _ 98:41 _ 103.59. 
MSres 95 
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The sampling distribution of this statistic (assuming no linear trend) may 
be approximated by an F distribution having degrees of freedom (1,54). 
The linear trend is significant beyond the .01 level. 

For these data the numerical value of the quadratic comparison is zero 
to two decimal places. The mean Square corresponding to the cubic com- 
parison is ШМ ay. _ (C169 _ 54. 

nÈ? 10(6) 
A test on whether or not the cubic trend adds significant predictability 
beyond that already given by the linear and quadratic trends employs the 


statistic ғ Маш 427 4 yg 
MSres 95 


Of this total, 102.68 is accounted for by the linear and cubic trends. The 
remaining between-block variation appears negligible relative to experi- 
mental error. The between-block variation due to higher-order trend 


SSnigner order — SSbiocks E SSun ES SSquaa ғ SSeubic 
= 1.23. 
The corresponding mean square is 


MSnigher order = ES = 41, 


The F ratio in the test for trend components higher than the third is 


ies MShigher order AN Al <1 
MSres EPUM 


Since this ratio is less than unity, the data indicate that no component 
higher than the third is relevant. 


An alternative, more widely used testing procedure for trends is possible. 
In this procedure the error term in the test for linear trend is obtained from 
SSaev ип = SSres + (SSbtocks — SSiin) 

= 51.30 + (103.94 — 98.41) = 56.83. 
The degrees of freedom for SS, tin in this type of design are n(k — 1)—1. 
For the case under Consideration, the degrees of freedom are 60 — 1 — 59. 
The mean square for deviations from linearity is 


SSaev ип 


59 


MSae, lin — 


= .96. 
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Under this testing procedure the statistic used in the test for linear trend is 
MSiin 98.41 
“Елы a 
MSeev ип 96 


The sampling distribution for this statistic may be approximated by ап F 
distribution having degrees of freedom (1,59). This test procedure has a 
slight negative bias; i.e., if F (1,59) is used asa critical value fora test having 
а = .05, this critical value will tend to be slightly larger than the exact critical 
value if the trend is actually different from linear. 


Table 4.6-3 Summary of Alternative Tests for Trend 


Source of variation SS df MS F 
Linear trend 98.41 1 98.41 102,51%% 
Dev from lin 56.83 59 96 
Quadratic trend .00 1 
Dev from quad 56.83 58 
Cubic trend 4.27 1 4.27 4.64* 
Dev from cubic 52.56 57 92 


Under the alternative test procedure, the test for cubic trend uses a 
denominator obtained from 
SSaev cubic = SSres + (SSpiocks — SSin — SSquaa Pe SScubic) 
= 51.30 + (103.94 — 98.41 — .00 — 4.27) = 52.56. 
The degrees of freedom for this source of variation are пк — 1) — 3, which 
inthiscaseis60 — 3 = 57. The mean square corresponding to this source 
of variation is 


a 


SS, 52.5 
MSeev cubic = еа e 7 = 92. 


The test for cubic trend uses the statistic 


zc Mine, ЖШ E pei 
MSeev cubic .92 


The sampling distribution of this statistic may be approximated by an F 
distribution having (1,57) degrees of freedom. The critical value for this 
test (« = .05) is 4.01. Hence the data contradict the hypothesis that the 
cubic trend adds no predictability to the linear and quadratic trends. 

From some points of view this alternative test procedure is to be preferred 
to that which has a constant denominator. Under this procedure, the 
denominator of F ratios tends to err on the side of being too large. When 
the degrees of freedom for MS,,, are large (say, over 30), the first approach 
presented differs only slightly from the alternative approach. 
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4.7 Analysis of Variance for Ranked Data 


Suppose that an experimenter is interested in determining whether or not 
there is any difference between various methods of packaging a product. 
In an experiment, subjects are asked to rank the methods in order of pref- 
erence. A numerical example of this kind of experiment is given in Table 
4.7-1. Person 1, for example, assigned rank 1 to method 3, rank 2 to 
method 2, rank 3 to method 1, and rank 4 to method 4. 


Table 4.7-1 Numerical Example 


Person Method 1 Method 2 Method 3 Method 4 Total 
1 3 2 1 4 10 
2 4 3 1 2 10 
3 2 4 1 3 10 
4 1 3 2 4 10 
5 2 3 1 4 10 
6 1 4 2 3 10 
7 2 3 1 4 10 
8 1 4 2 3 10 
Total 16 26 11 27 80 
ee == мн ннн ААД 
G? 2 
() =z; = 20000 (у= х5Х2-20 (3) = == = = = 222.75 
=P? 800 
(4) — Ape em 200.00 
I ——— Ж —áe 4 00 
SSmethoas = (3) — (1) = 22.75 
55а = (2) — (3) - (4) + (1) = 1725 
55у. peopte = (2) — (4) = 40.00 


The computational formulas in Sec. 4.2 may be used to obtain sums of 
Squares required for the analysis of variance. SSpetween peopte Will always be 
zero. Upon replicating the experiment with random samples of subjects, 
SS, Will remain constant provided that no tied ranks are permitted. 
Rather than an F statistic, the chi-square statistic 

SSmethoas 
(SSmetnoas ES SSres)/n(k z 1) 
= n(k Ens 1)SSmethoas 
98ү. people 
is used to test the hypothesis of no difference in mean rank for the methods. 
The higher the agreement between people in ranking the methods, the larger 
SSnethoas Will be. For the data in Table 4.7-1, 
_ 8(3)(22.75) _ 


За = йук. 


(1) {мы 
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The critical value of this statistic for a .01-level test is 
Lolk — 1) = 53) = 11.3. 


Since the observed y? exceeds the critical value, the data contradict 

the hypothesis of no difference between the mean ranks for the different 

methods of packaging. Inspection of Table 4.7-1 indicates that methods 1 

and 3 have the smaller means and methods 2 and 4 the larger means. 
When no tied ranks are permitted, 


nk(k? — 1 
SS, people = “a А 
and the x? statistic becomes 
12 
2 * = ————__ (277) — 3n(k + 1). 
(2) Xranks ПК 4 1 7) ( ) 


The expression for 7? in (2) is algebraically equivalent to (1) when no ties are 
permitted; (1) may be used whether or not ties are present. For the data 
in Table 4.7-1, expression (2) gives 


See 7. 1782) — 3(8)(5 
perry rt 20 
= 13.65. 


Use of this statistic in testing the hypothesis on the mean ranks defines what 


is called the Friedman test. н à 4%. 
An index of the extent to which people agree in their preferences is given 


by the coefficient of concordance, which is defined as 
SSmethoas 
КҮК Н 
This coefficient is related to the average intercorrelation between the rank- 
ings assigned by the people; this relationship is 
п = 1 


(3) W= 


(4) R= 


esis that the coefficient 


i d as a test of the hypoth 
The test in (1) may be regarded as a te yp тА 


of concordance in the population of people is zero. 


4.7-1 

; 22.75 

W=—— = .569. 
40.00 


The average intercorrelation between the people is 
80569) — 1 «оу, 
8—1 
The index W corresponds to a correlation ratio, whereas 7 corresponds to a 


product-moment correlation (or equivalently a rank-difference correlation). 


Rc 
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The average intercorrelation between the people may also be computed by 
-means of relationship (6^) in Sec. 4.5, provided that л is used in place of К, 
In terms of this relationship, 


; MSvetween methods — MSres 
MSbetween methods = (n == 1)MSres 
7.583 — .821 = 507. 


~ 7583 + 7(.821) 
4.8 Dichotomous Data 


Observed data may in some cases be classified into one of two classes; 
for example, a characteristic may be present or absent, a response is either 
yes or no, a drug is either fatal or itis not. Such data are said to be dichoto- 
mous. One of the two dichotomies may conveniently be designated by a 
0, the other bya 1. Which category is assigned the zero is arbitrary, 


Table 4.8-1 Numerical Example 
EE eee 


Subject Time 1 Time 2 Time 3 Time 4 Time 5 Total 
1 0 0 0 0 0 0=P, 
2 0 0 1 1 0 2=P, 
3 0 0 1 1 1 3 
4 0 1 1 1 1 4 
5 0 0 0 0 1 1 
6 0 1 0 1 1 3 
7 0 0 1 1 1 3 
8 1 0 0 1 1 3 
9 1 1 1 1 1 ә 

10 1 Ie 1 1 1 1 5 

Total 3 4 6 8 gm 

23-8 = = ы = = 
(1) fmm 16.82 (2) = IEX: 29 (3) = E = 18.90 
2 


ХР? 
(Dc сек Оо 


k 5 н 
SShetween people = (4) — (1) = 4.58 df 


55, people = (2) - (4) = 7.60 
SStime = (3) — (1) = 2.08 4 MStime = .520 
SSrog =(2) – (3) – (4 + (1) = 5.52 36 MS; = .153 
SStotal = (2) — (1) = 12.18 


Consider an experiment designed to study the effects of an advertising 
campaign upon the attitude of a potential population of buyers toward the 
product. The data may take the form given in Table 4.8-[. In this table 
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for the study. Each subject is interviewed at the end of five different time 
periods. For example, subject 1 was not favorable at any time; subject 5 
was not favorable on the first four interviews but was favorable on the fifth 
interview. 

To test the hypothesis of no change in the percentage of favorable replies 
over the time periods, the statistic 


n(k — 1)SStime 
SSw, people 


(1) Qi: 


may be used. The Q statistic has the same form as 72, „кь, Which was used 
inSec.4.7. Under the hypothesis of no change in the percentage of favor- 
able replies, Cochran (1950) has shown that the sampling distribution of 
the Q statistic is approximated by a chi-square distribution with k — 1 
degrees of freedom, when n is reasonably large. For an a-level test the 
critical value for the О statistic is y? „(Е — 1). 

For the data in Table 4.8-1, 


0195 = DCW _ 1995, 
7.60 


The critical value for a .05-level test is 5,(5 — 1) = 9.5. Hence the experi- 
mental data contradict the hypothesis of no change in the percentage of 
favorable replies. Examination of the data indicates a systematic increase 
in the percentage of favorable replies. Cochran (1950) has indicated that 
the F statistic computed by treating the data as if the measurements were 
normally distributed variables will yield probability statements which are 
relatively close to those obtained by use of the Q statistic. For the data in 


Table 4.8-1, p — MSume _ 520 _ 340, 
MS "153 


For a .05-level test the critical value for the F statistic is F (4,36) = 2.63. 
Thus use of the F statistic also leads to the rejection of the hypothesis of no 
change in the percentage of favorable replies. 


CHAPTER 5 


Design and Analysis 
of Factorial Experiments 


5.1 General Purpose 


Factorial experiments permit the experimenter to evaluate the combined 
effect of two or more experimental variables when used simultaneously. 
Information obtained from factorial experiments is more complete than 
that obtained from a series of single-factor experiments, in the sense that 
factorial experiments permit the evaluation of interaction effects. An inter- 
action effect is an effect attributable to the combination of variables 
above and beyond that which can be predicted from the variables 
considered singly. 

For example, many of the properties of the chemical substance H,O 
(water) cannot be predicted from the properties of oxygen and the properties 
of hydrogen studied in isolation. Most of the properties of water are attrib- 
utable to the effect of the interaction between oxygen and hydrogen. The 
compound formed by this interaction has properties which are not given by 
simply adding the properties of oxygen to the properties of hydrogen. 

At the end of a factorial experiment, the experimenter has information 
Which permits him to make decisions which have а broad range of appli- 
cability. In addition to information about how the experimental variables 
operate in relative isolation, the experimenter can predict what will happen 
when two or more variables are used in combination. Apart from the 
information about interactions, the estimates of the effects of the individual 
variables is, in a sense, a more practically useful one; these estimates are 
obtained by averaging over a relatively broad range of other relevant 
experimental variables. By contrast, in a single-factor experiment some 


randomized. In the case of a factorial experiment, the population to which 
inferences can be made is more inclusive than the corresponding population 


140 
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a sharp distinction must be drawn between experiments involving repeated 
measures on the same elements and those which do not involve repeated 
measures. The material in this chapter will be concerned primarily with 
experiments which do not involve repeated measures, However, many of 
the basic principles to be developed in this chapter will be applicable, with 
only slight modification, to the case in which there are repeated measures, 

The term factor will be used in a broad sense. For some purposes a 
distinction will be made between treatment and classification factors. The 
latter group the experimental units into classes which are homogeneous 
with respect to what is being classified. In contrast, treatment factors 
define the experimental conditions applied to an experimental unit. The 
administration of the treatment factors is under the direct control of the 
experimenter, whereas classification factors are not, іп а sense. The effects 
of the treatment factors are of primary interest to the experimenter, whereas 
classification factors are included in an experiment to reduce experimental 
error and clarify interpretation of the effects of the treatment factors. 

The design of factorial experiments is concerned with answering the 
following questions: 

1. What factors should be included ? 

2. How many levels of each factor should be included ? 

3. How should the levels of the factors be spaced ? 

4, How should the experimental units be selected ? 

5. How many experimental units should be selected for each treatment 
combination ? 

6. What steps should be taken to control experimental error? 

7. What criterion measures should be used to evaluate the effects of the 
treatment factors? 

8. Can the effects of primary interest be estimated adequately from the 
experimental data that will be obtained ? Seats 

The answers to these questions will be considered in some detail in this 
chapter and the chapters that follow. An excellent and readable overview 
of the planning of factorial experiments will be found in Cox (1958, chap. 7). 


5.2 Terminology and Notation 


The term factor will be used interchangeably with the terms treatment and 
experimental variable. More specifically, a factor is a series of related 
treatments or related classifications. The related treatments making up a 
factor constitute the /evels of that factor. For example, a factor color may 
consist of three levels: red, green, and yellow. А factor size may consist of 
two levels: small and large. A factor dosage may consist of four levels: 
1 сс,3 сс,5 сс,ап47 cc. The number of levels within a factor is determined 
largely by the thoroughness with which an experimenter desires to investi- 
gatethefactor. Alternatively, the levels of a factor are determined by the 
kind of inferences the experimenter desires to make upon conclusion of the 
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experiment. The levels of a factor may be quantitative variations in an 
essentially quantitative variable, or they may be qualitatively different 
categories within an essentially qualitative variable. Basically a factor is a 
qualitative variable; in special cases it becomes a quantitative variable. 
The dimensions of a factorial experiment are indicated by the number of 
factors and the number of levels of each factor. For example, a factorial 
experiment in which there are two factors, one having three levels and the 
other having four levels, is called a 3 x 4 (read "three by four") factorial 
experiment. Ina2 x 3 x 5 factorial experiment, there are three factors, 
having respective levels of two, three, and five. (Many different designs 
may be constructed for a given factorial experiment.) The treatment com- 
binations ina 2 x 3 factorial experiment may be represented schematically 


as follows: 
Levels of factor B 
| Балы 2% 
а, | abı аба абу 
Levels of factor A 


a| aby аб abs 


In this schema, a, and a, designate the levels of factor А; b,, bs, and b; 
designate the levels of factor B. Ina2 x 3 factorial experiment six possible 
combinations of treatments may be formed. Level a, may be used in com- 
bination with each of the three levels of factor B; level a, may also be used in 
combination with each of the three levels of factor B, The resulting treat- 
ment combinations are labeled in the cells of the schema. For example, 
the symbol ab,, represents the experimental condition resulting when factor 
Ais at level a, and factor B is at level bs. 

‚ For the case of a p x д factorial experiment, pq different treatment com- 
binations are possible. Ina P X q X r factorial experiment there are pqr 
treatment combinations. As the number of factors increases, or as the 
number of levels within a factor increases, the number of treatment com- 
binations in a factorial experiment increases quite rapidly. For example, 
a5 x 5 x 5 factorial experiment has 125 treatment combinations. 

In an experiment, the elements observed under each of the treatment 
combinations will generally bearandom sample from some specified popula- 
tion. This specified population will, in most cases of interest, contain a 
potentially infinite number of elements. If n elements are to be observed 
under each treatment combination in a P X q factorial experiment, a ran- 
dom sample of npq elements from the population is required (assum- 
Ing no repeated measurements on the same elements). The npq elements 
are then subdivided at random into pq subsamples of size n each. These 
subsamples are then assigned at random to the treatment combinations. 

The potential (or population) levels of factor 4 will be designated by the 
symbols a, а,..., аһ...,ар. The number of such potential levels, P, 
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may be quite large. The experimenter may group the Р potential levels into 
p levels(p < P) by either combining adjoining levels or deliberately select- 
ing what are considered to be representative levels. For example, if factor 
A represents the dimension of age, the experimenter may choose to group 
the levels into 1-year intervals; alternatively, the experimenter may choose 
to group the levels into 2-year intervals. On the other hand, if factor A 
represents a dosage dimension, the experimenter may deliberately select a 
series of representative dosages; i.e., in terms of previous research the levels 
selected may be representative of low, middle, and high dosages. 

When p, the number of levels of factor A included in the experiment, is 
equal to P, then factor A is called a fixed factor. When the selection of the 
p levels from the potential P levels is determined by some systematic, non- 
random procedure, then factor A is also considered a fixed factor. In this 
latter case, the selection procedure reduces the potential P levels to p effective 
levels. Under this type of selection procedure, the effective, potential 
number of levels of factor A in the population may be designated Ре, 
and Portective = P. 

In contrast to this systematic selection procedure, if the p levels of factor 
A included in the experiment represent a random sample from the potential 
P levels, then factor A is considered to be a random factor. For example, in 
an experiment designed to test the effectiveness of various drugs upon cate- 
gories of patients, the factor 4 may be the hospitals in which the patients are 
located. Potentially the number of different hospitals may be quite large. 
If a random sample of p of the P potential hospitals are included in the 
experiment, then factor A is a random factor. (If, further, the sampling 
within each of the hospitals selected is proportional to the number of 
patients within the hospital, then conclusions drawn from this kind of 
experiment will be relevant to the domain of all patients and not just to the 
domain of all hospitals. In most practical situations in which random 
factors are encountered, p is quite small relative to P, and the ratio p/P is 
quite close to zero. j 

Similar definitions apply to factor B. Let the potential levels of factor B 
be designated b,, bz, ..., by» ..., Во. Of these Q potential levels, let the 
number of levels actually in an experiment be q. If g= Q, or if the 
effective number of levels of factor B is reduced from Q to q by some syste- 
matic, nonrandom procedure, then factor B is considered fixed, (The re- 
duction from Q potential levels to q effective levels is a function of the 
experimental design.) The actual levels of factor B included in an experi- 
ment will be designated by the notation bı, Б,..., bj os +s ba If the q 
levels inan experiment area random sample from the Q potential levels, then 
factor B is considered a random factor. In most practical cases in which 
factor B is a random factor, Q will be quite large relative to q, and the ratio 


q/Q will be close to zero. - . 
The ratio of the number of levels of a factor in an experiment to the 
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potential number of levels in the population is called the sampling fraction 
бог а factor. In terms of this sampling fraction, the definitions of fixed and 
random factors may be summarized as follows: 


Sampling fraction Factor 
pip =1 or PlPeftective = 1 A is a fixed factor 
РІР =0 A is a random factor 
410 —1 Or = q/Qerrective = 1 B is a fixed factor 
4/0 +0 B is a random factor 


Cases in which the sampling fraction assumes a value between 0 and 1 do 
оссиг in practice. However, cases in which the sampling fraction is either 
1 or very close to 0 are encountered more frequently. 

The symbol a; will be used to refer to an arbitrary level of factor 4 when 
the frame of reference is the potential P levels of factor А. The symbol a; 
will be used to refer to an arbitrary level of factor A when the frame of 
reference is the p levels actually included in the experiment. A similar 
distinction will be made between the symbols 5; and 5, The first symbol 
has as its frame of reference the potential Q levels of factor B, whereas the 
second symbol has as its frame of reference the 4 levels of factor В in the 
actual experiment. In those cases in which factor A is a random factor, the 
symbol а), when the frame of reference is the potential population levels, 
refers to a specified population level. However, when the frame of refer- 
ence is the experiment, the symbol a, refers to a specified level in the experi- 
ment. Thus the symbol a, may refer to different levels of the same factor, 
depending upon the frame of reference. The context in which the notation 
is used will clarify the ambiguity of the symbol considered in isolation. 

To illustrate additional notation, assume that all the potential elements are 
included in all the potential cells of a factorial experiment. (Generally only 
a random sample of the potential elements are included in a cell of a factorial 
experiment.) Assume further that, after the treatment combinations have 
been administered, each of the elements is measured (observed) on the 
characteristic being studied (the criterion or dependent variable). The 
mean of the observations made under each treatment combination will be 
denoted by the following notation: 


by by by bo 
a 1 Hiz OE Mg Wen IQ 
az May Hoz терү, а 2450, 
27 Hn Ша уы ейін тита 
ар Арі Ир» кыру Mets PURO 
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In this notation, иу denotes the mean on the criterion for the potential 
population of elements under treatment combination ab;;. Equivalently, 
шу represents the population mean for the dependent variable in cell абу. 
It will be assumed that the potential number of elements in each of the cells 
is the same for all cells. 

The average of the cell means appearing in row J is 


Уш» 
2j 


Ш. = Q . 


In words, иу. is the mean of the dependent variable averaged over all poten- 
tial treatment combinations in which factor A is at level ау. Similarly, the 
mean of the dependent variable averaged over all potential treatment com- 
binations in which factor B is at level b; is 

dew 

1 


Bg = pu 


The grand mean on the dependent variable is 


a 22ers T Хи. Bes 


PQ Р Q 

The grand mean may also be defined as the mean of the dependent variable 
for all potential observations under all potential treatment combinations. 
Notation for the statistics obtained from actual experimental data is as 
follows. (It is assumed that each cell mean is based upon an independent 


random sample of size n.) 


by b vt 5 ven Be 
a АВ АВ!» АВ}; АВ, n 
a, | А8 АВ, АВ АВ, i, 
а АВ, АВ» АВ) АВ, 4; 
ау АВ, АВ, АВ, АВ, 4, 
В, B, B, B, б 


The symbol АВ, represents 


ent variable for the elemen 


the mean of the measurements on thedepend- 


average of all observations at level а; is 


4 


Ya, 


V RE eds 


ts under treatment combination а), The 
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Similarly, the average of all observations at level b; is 


528, 
5-3 


P 
The grand mean G is the mean of all means. Thus 


Pq p q 


In most designs for factorial experiments, these statistics are unbiased 
estimates of corresponding parameters. That is, 


Е(АВ,) = щь Е(Д) = ш, and E(B) =p. 
5.3 Main Effects 


Main effects are defined in terms of parameters. Direct estimates of 
these parameters will, in most cases, be obtainable for corresponding statis- 
tics, The main effect of level a, of factor A is by definition 


Ur inmate 
In words, the main effect for level a, is the difference between the mean of all 
potential observations on the dependent variable at level a, and the grand 
mean of all potential observations. The main effect for level a; may be 


either positive or negative. The main effect for level ar, where I’ designates 
a level of factor A different from Г, is 


Kreis 
For most practical purposes, only the difference between two main effects 
will be needed. The differential main effect is defined to be 
Г fy, Ву 


(In some experiments, only the differential main effects can be estimated 
from the observed data.) 


Analogous definitions hold for the main effects of the levels of factor B. 
Thus, the main effect for level b, is 


By = иу — А... 
The differential main effect for levels b запа b; is 


By = By =m — [Eg 
A differential main effect measures the extent to which criterion means for 
two levels within the same factor differ. Thus, if all the means for the 
various levels of a factor are equal, all the differential main effects for the 
levels of that factor will be zero. It should be pointed out that equality of 
the population means does not imply equality of the sample main effects. 
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Hence estimates of differential main effects may not be zero even though the 
population values are zero. However, when the population differential 
main effects are zero, their sample estimates will differ from zero by amounts 
which are functions of experimental error. 

The variance of the main effects due to factor A is, by definition, 


Ха-ал УФ 
Р-1  P=1 
An equivalent definition in terms of the differential main effects is 
ae X(x; — ay)? : 
P(P — 1) 
The symbol J < /' indicates that the summation is over different pairs of 
«’s; that is, ау — % is not considered to be different from о, — о. The 
variance of the main effects of factor 4 measures the extent to which the 
criterion means for the various levels of factor A differ. Equivalently, 
the variance of the main effects may be regarded as an over-all measure 
of the differential main effects for that factor. Thus, o; will be small when 
all the differential main effects are small; o2 will be large when one or 
more of the differential main effects are large. When all the џи; are equal, o; 


will be zero. 
Analogous definitions hold for the variance of the main effects of factor B. 


Susu) ОВ 
PECES ATEM. DER НА 


f= 


(І < Г). 


2 


ор = осі = Q-1 ; 
In terms of the differential main effects, 
dbs — Br? 
о= =” , O<). 
Q(Q — 1) 


To illustrate the definitions of main effects and their variance, suppose the 
population means on the dependent (criterion) variable are those given in 
the following table. Here P = 2and Q = 3. 


b by bs Mean 
ау 10 5 15 10 
а 20 5 5 10 
Mean 15 5) 10 10 
For the data in this table, ш = 10, м, = 10, and џи, = 10. Hence 
ад = 0,ап4оҙ = 0. Itisalso notedthat и = 15,45 =5, из = 10. Hence 


B, =15—10=5, 
B= 5—10=—5, 
= 10 — 10 — 0. 
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The differential main effects for factor B are 
В, Ba = 10, P Bs 25 Bs Ёз 3: 


[In the general case there will be Q(Q — 1)/2 distinct differential main effects 
for factor B.] 
The variance due to the main effects of factor B is 


Ps 52 + (—5)? + 0° 
Cnt n 


25. 


In terms of the differential main effect of factor B, the variance is 

2 XB, ra By? 

0(0- 1) 

2; 10? + 5? + (—5)? 
38 — 1) 


= 25. 


In summary, main effects as well as differential main effects are defined in 
terms of a specified population and not in terms of individual treatments 
within the population. Thus a main effect is not a parameter associated 
with a specified level of a specified factor; rather, a main effect depends upon 
all the other factors that may be present, as well as the number of levels 
assumed for the specified factor. It should also be noted that main effects 
are computed from means which are obtained by averaging over the totality 
of all other factors present in the population to which inferences are to be 
made. 


5.4 Interaction Effects 


The interaction between level a, and level Б, designated by the symbol 
ар,» is a measure of the extent to which the criterion mean for treatment 
combination ab; , cannot be predicted from the sum of the corresponding 
main effects. From many points of view, interaction is a measure of the 
nonadditivity of the main effects. To some extent the existence or non- 
existence of interaction depends upon the scale of measurement. For 
example, in terms of a logarithmic scale of measurement, interaction may 
not be present, whereas in terms of some other scale of measurement an 
interaction effect may be present. The choice оға scale of measurement for 
the dependent variable is generally at the discretion of the experimenter. If 
alternative choices are available, then that scale which leads to the simplest 
additive model will generally provide the most complete and adequate 
summary of the experimental data. 

In terms of the population means and the population main effects, 


abis = иту — (x + By + ш). 
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From the definition of the main effects, the above expression may be shown 
to be algebraically equivalent to 
«Ву = Шу — Ш. — Шу es 
From the way in which ш; and уи were defined, it follows that 


Sobry = Xu; — Хш. — Des + Ўр. 
1 1 1 1 1 
= Риу — Ри. — Pug + Ри, 


= 
It also follows that 


Хо) = 0. 
7 


In words, the sum of the interaction effects within any row or any column of 


the population cells is zero. 
Differential interaction effects are defined by the following examples: 


(i) ар» — a, = Mis — Иза (a4 — %3) — (Pa — Ёа), 
(ii) «В — о а = Шз — Has — (Pa Вз), 
(iii) ара — «зз = з — Haz (01 — 23). 


In (i) the differential interaction effect depends upon two differential main 
effects as well as the difference between cell means. In (ii) and (iii) only a 
single differential main effect is involved; (ii) depends upon the differential 
main effect associated with two levels of factor B, whereas (iii) depends upon 
a differential main effect associated with two levels of factor A. Relative 
to (i), the differential interaction effects represented by (ii) and (iii) are 
classified as simple interaction effects. 

The variance due to interaction effects in the population is, by definition, 


Qni XX); 1 
ы 
An equivalent definition can also be given in terms of the differential inter- 
action effects. Under the hypothesis that the individual cell means may be 
predicted from a knowledge of corresponding main effects, оз, = 0; that is, 
under the hypothesis of additivity of main effects, the variance due to the 
interaction effect is equal to zero. 
For the numerical data given in the last section, 

бы = 10 — 10 — 15 + 10 = —5, 

apa = 5—10— 5+10=0, 

aps = 15 — 10 — 10 + 10 — 5. 


It is also noted that afi; = —5 + 0 + 5 = 0. 
7 
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5.5 Experimental Error and Its Estimation 


All uncontrolled sources of variance influencing an observation under a 
specified treatment combination contribute to what is known as the variance 
due to experimentalerror. The greater the number of relevant sources of 
variance which are controlled and measured, the smaller the variance due to 
experimental error. In a population in which there are N potential 
Observations (i.e., measurements on a criterion) under a specified treatment 
combination, the variance of these observations defines the variance due to 
experimental error. Since all the observations within this cell have been 
made under the same treatment combination, effects associated with the 
treatment per se have been held constant. Hence the treatment effects 
per se are not contributing to the within-cell variance. Differences between 
units of the experimental material existing prior to the experimental treat- 
ment, variance introduced by inaccuracies or uncontrolled changes in 
experimental techniques, possible unique interactions between the material 
and the treatments—all these sources contribute to the within-cell variance. 

If an observation on element K under treatment combination ab;, is 
designated by the symbol Хуу, then the variance of the М potential ele- 
ments under this treatment combination is given by 


ЖХик = шу)? 
М-і 


2 
т. зе 


[In the usual definition of a population variance, N rather than N — 1 
appears as the divisor. However, using N — 1 as the divisor simplifies the 
notation that follows. This definition of олу is the one adopted by Corn- 
field and Tukey (1956, р. 916). As N approaches infinity, N — 1 ap- 
proaches N.] Thus, o? ә» the within-cell variance for the population cell ab; ;, 
is the variance due to experimental error in this cell. For purposes of 
making tests of significance and obtaining confidence bounds on parameters, 
it must be assumed that the variance due to experimental error is constant 
for all cells in the experiment. This is the assumption of homogeneity of 
error variance; this assumption may be represented symbolically by 


91; = 02 ога IJ’s, 


where o? is the variance due to experimental error within any cell in the 
population. For the same purposes, it will generally be assumed that X; IK 
1s normally distributed within each of the cells. 

Inan experiment, assume that there are n independent observations within 
each of the treatment combinations included in the experiment. The 
variance of the observations in the cell ab;, is given by 


Хх.» - АВ, 


п—1 


2 
Sij 


ESTIMATION OF MEAN SQUARES 151 


Assuming that the experimental error in the population is o; for all cells, 
s provides an estimate of оў. A better estimate of о? is obtained by 
averaging the within-cell variances for each of the pq cells in the experiment. 
This average within-cell variance, denoted by the symbol 52 ооа» May be 
represented as 
on Zxsh 
pooled = ~g 

Under the assumptions made, 5% Will be an unbiased estimate of o?. 
Further, the sampling distribution of pq(n — 1)Sootea/% may be approxi- 
mated by a chi-square distribution having pq(n — 1) degrees of freedom. 
[Since the variance within each of the pq cells in the experiment is based upon 
n independent observations, each of the within-cell variances has n — 1 
degrees of freedom. The pooled variance, 52 ооа» has degrees of freedom 
equal to the combined degrees of freedom for each of the within-cell 
variance, that is, pq(n — 1).] Because the observations within each of the 
cells are independent, the degrees of freedom for each of the variances 
that are averaged are additive. 

The statistic 2а is called the within-cell mean square, abbreviated 
MS, ,,. Alternatively, this source of variance is designated as MSerror in 
designs which do not have repeated measures. 


5.6 Estimation of Mean Squares Due to Main Effects and 
Interaction Effects 


Some of the assumptions that will be made for purposes ofestimation and 
analysis in a two-factor experiment are summarized by the following struc- 


tural model: 
(1) Xim =н. + ч + By + Abia + tae 


In this model, Хуу, is an observation made in the experiment on element k 
under treatment combination ab; On the right-hand side of (1) are the 
factorial effects and the experimental error. This model assumes that the 
factorial effects as well as the experimental error are additive, i.e., that an 
observation is a linear function of the factorial effects and the experimental 
error. Expression (1) is a special case of the general linear hypothesis. 
Each of the terms on the right-hand side is assumed to be independent of the 
others. 

The left-hand side of (1) represents an observation on the dependent 
variable made in the experiment. The terms on the right-hand side of (1) 
are parameters of the independent variables which account for the dif- 
ferences between observations. The terms on the right-hand side cannot 
be observed directly; however, data from the experiment will generally 
permit the experimenter to estimate certain linear functions of these effects 


and test hypotheses about such functions. 
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For all observations made under treatment combination ab,,, the terms 
о, Bj, and «f, are constant. (и.. is a constant for all observations.) 
Hence the only source of variation for observations made under treatment 
combination ab,; is that due to experimental error. If 55; is the variance of 
the observations within cell a5;;, then з? is an estimate of o2. 

In terms of (1), the mean of the observations within cell ab,;may be ex- 
pressed as 

ЖЕ ХХ,» 
(2) AB;; = = г. + + Bs «В, d 85. 


The notation ғ,, denotes the mean experimental error for the n observations 
in cell ab,; If the observations in cell аб, were to be replicated with an 
independent sample of size n, the term £j; would not remain constant. 


Table 5.6-1 Structural Parameters Estimated by Various Statistics 


Statistic Structural parameters estimated 
A, — Ay (n; — а) + (ай; — ар) + (E; — &) 
is BE (B; — By) + (28, = apy) + (& — &) 
АВ — АВ, (0: — ai) + (8, — By) + («б — «Вгу) + (5; — &y) 
(AB; — A; — B) — 


(АВ, — Ay — Bj) (Bi; — өй) + (8 — Epy) 


However, all the other terms on the right-hand side of (2) would remain 
constant for this kind of replication. Thus, for a large number of replica- 
tions of the observations within cell ab,,, each replication with an independ- 
ent sample of size л, 


(3) Е(АВ,,) = E(u.) + E(a,) + E(B,) + Е(ар,) + E(&) 
= 4. + a+ By + аб, + 0. 


In this context, the expected value of terms on the right-hand side is the 
average value of a large number of replications for cell ab,; with independent 


samples. Although the mean experimental error ғ, has expected value 
equal to zero, it varies from one replication to thenext. Hence the variance 
of 2,; is not zero. 

From (1) and the definitions of the various parameters, the statistics sum- 
marized in Table 5.6-1 have the expressions indicated. The right-hand side 
of this table gives the structural variables estimated by the statistics on the 
left-hand side. 


The notation ай, denotes the average for level a, of the о), effects over the 
levels of factor B included in the experiment. Ifg = Q (that is, if factor B 


is fixed), then p is zero. If 4% О, ap, need not be zero for any single 
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experiment. To illustrate this point, suppose that there are six levels of 
factor B in the population of such levels, and suppose that the interaction 
effects associated with level a; are those given below: 


b, b, bs b, bs bg 


а af —3 аба-2 оба=—3 afa = —2 96-5 а= —S 
In this case О = 6. Note that >=, = 0. Suppose that only a random 
J 


sample of q = 3 levels of factor В is included in any single experiment. 
Suppose that the levels bs, by, and b; are included in an experiment which is 
actually conducted. For this experiment 
af, 2%, аа + «Ва + «б Dy 
й 4 3 3 

For a large number of random replications of this experiment, where ап 
independent random sample of the levels of factor B is drawn for each 
replication, the expected value of op; will be zero, The variance of the 
distribution of «f, generated by this kind of sampling procedure depends 
upon n, q, Q, and the variance 025. 

The notation 28, denotes the average of the а); effects over the levels of 
factor A present in an experiment at level b, of factor B. If factor A is а 
fixed factor, of, = 0. However, if factor A is not a fixed factor, aj; is 
not necessarily equal to zero for any single experiment. Over a large 
number of random replications of the experiment, the expected value of 


ap, will be equal to zero. If factor A is a fixed factor, 9, will be zero, since 
afi, will be zero for all replications. If factor 4 is a random factor, 
©з, will be a function of n, p, P, and озу. 

The mean square due to the main effects of factor A in the experiment is 
defined to be 


diio s 2 
MS, — тх — б) » 
p-1 
An equivalent definition in terms of differences between pairs of means is 
ДАЕТ) 
Ms, = М 4 
pp — 1) 
The summation in this last expression is with respect to all distinct pairs of 


means, no pair being included twice. The multiplier nq is the number of 
observations in each А. The expected value of A; — A, (for independent, 


random replications of the experiment) is 
E(A, — Ay) = E(u: — a) + EGP; — Be) + E, — &) 
+0 +0. 


= 6; 
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The expected value of MS, may be shown to be 
E(MS,) = (=) 4+ ( gi 4 ngos. 


Detailed steps in the derivation of this latter expected value are given by 
Cornfield and Tukey (1956). 
The mean square due to factor B is defined to be 

ME npX(B; — Gy 

q—1 
— npX(B, — В, 

alq — 1) 
The multiplier np represents the number of observations in each B, The 
expected value of this mean square in terms of the parameters in (1) is given 
in Table 5.6-2. 

Table 5.6-2 Expected Values of Mean Squares 


MS, 


Mean square as obtained Expected value of mean square in 


from experimental data terms of parameters of (1) 
MS, (1 — n[N)o? + n(1 — 4/0)02, + ngo? 
MS, (1 — n/N)o? + n(1 — р[Р)ойв + проф 
М8, (1 — n[N)c? + по? з 
MSerror (1 — n/N )o? 


The mean square due to interaction effects in the experiment is defined as 
MS, = nZX(AB,, — Д, — B, + e 
(p — 1)(q — 1) 

An equivalent definition can be given in terms of differential interaction 
effects. The multiplier л is the number of Observations in each АВ,,. Тһе 
expected value of this mean Square represents the average of the MS,, com- 
puted from a large number of independent, random replications of the 
experiment; this average value is expressed in terms of the parameters in the 
general linear model (1), which is assumed to represent the sources of 
variance underlying an observation. 

Certain of the coefficients in Table 5.6-2 are either zero or unity, depend- 
ing upon whether a factor is fixed or random. It will generally be assumed 
that the number of elements observed in an experiment is small relative to 
the number of potential elements in the population of elements, i.e., that 
the ratio n/N for all practical purposes is equal to zero. Hence the co- 
efficient 1 — n/N is assumed to be equal to unity. 

If factor A is fixed, the ratio p/P will be equal to unity and the coefficient 
1 — p/P will be equal to zero. If, on the other hand, factor 4 is random, 


ESTIMATION OF MEAN SQUARES 155 


the ratio p/P will be equal to zero and the coefficient 1 — p/P will be equal 
tounity. Inan analogous manner, the coefficient 1 — q/Q is equal to unity 
when factor B is a fixed factor and equal to zero when factor B is a random 
factor. 

Special cases of the expected values of the mean squares are summarized 
in Table 5.6-3. Each of these cases is obtained from the general values 
given in Table 5.6-2 by evaluating the coefficients which depend upon the 
ratios n/N, p/P, and 4/0. Several different approaches may be used to 
obtain the special cases given in Table 5.6-3. Specialization of the general- 
ized approach represented by Table 5.6-2 provides the simplest method of 
attack on the evaluation of more complex experimental designs. 


Table 5.6-3 Special Cases of Expected Values of Mean Squares 
ee ESS ———— 


Case 1 Case 2 Case 3 
Mean squares Factor A fixed Factor A fixed Factor A random 
Factor B fixed Factor B random Factor B random 
MS, o? + nqoi oè + nol + ngoz a? + no; + ngog 
MS, о? + проў в? + проф oè + noga + проў 
MS, o + по? оў + поё з o + поё р 


2 2 
MSerror 9; | 9; 17% 


Case 1, in which both factors аге fixed, has been designated by Eisenhart 
(1947) as model I. Case 2, in which one factor is fixed and the second is 
random, is called the mixed model. Case 3, in which both factors are 
random, is called model III, or the variance components model. Model I 
has been more extensively studied than the other two models. In its most 
general form, the statistical principles underlying model I are identical to 
those underlying the general regression model having any number of fixed 
variates and one random variate. As such, the best estimates of various 
parameters can readily be obtained by the method of least squares. For the 
case of the generalized model I, application of the method of least squares 
is straightforward and leads to no difficulties. For the generalized mixed 
model, application of the principles of maximum likelihood are more direct. 

Since the statistical tests made on the experimental data depend upon 
what these expected values are assumed to be, it is particularly important 
to specify the conditions under which these expected values аге derived. То 
obtain the general expected values given in Table 5.6-2, the following 
assumptions are made: y 

1. There is a population of size Р and variance o? of main effects of factor 
A, of which the effects (жу, о, . . - , %,) occurring in the experiment constitute 
a random sample (sampling without replacement) of size p. The sample 
may include all the levels of factor A in the population; that is, p may be 
equal to P. 
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2. There is a population of size Q and variance о) of main effects of factor 
B, of which the effects (0), Basix cuf) occurring in the experiment constitute 
arandom sample of sizeg. The sample may include all the levels of factor 
B in the population; that is, q may be equal to Q. 

3. There is a population of interaction effects of size PQ and variance 
azp; the «B,,’s which occur in the experiment correspond to the combinations 
of the levels of factor 4 and factor B that occur in the experiment. That is, 
one does not have a random sample of pq interaction effects; rather, the 
interaction effects in the experiment are tied to the levels of factor A and 
factor B that occur in the experiment. It is assumed that the average (in 
the population) of the interaction effects over all levels of one factor is 
independent of the main effects of the other factor; that is, x, is independent 
of a, and af), is independent of 8,. 

4. The sampling of the levels of factor A is independent of the sampling 
of the levels of factor В. 

5. The experimental error is independent of all main effects and all inter- 
actions. Further, within each cell in the population, e, the experimental 
error, is assumed to be normally distributed, with mean equal to zero and 
variance equal to о? for all cells in the population. 

6. The л observations within each cell of the experiment constitute a 
random sample of size n from a population of size N (assumed infinite in 
most cases). The л observations within each cell constitute independent 
random samples from a random sample of npg independent elements drawn 
from the basic population. 

For purposes of deriving the expected values of mean squares, some of 
these assumptions may be relaxed. The assumption of normality of the 
distribution of the experimental error is not required for the derivation of the 
expected values. However, all these assumptions are needed for the valid- 
ity of the tests involving the use of F ratios, which are based upon the 
expected values of the mean squares. In particular, the assumption that 
the distribution of the experimental error is normal is required in order 
that the sampling distributions of mean squares be chi-square distribu- 
tions. 

Under the conditions that have been stated, the mean squares computed 
in the analysis of variance have the following sampling distributions: 


Statistic Sampling distribution 
(p — 1)MS,/E(MS,) Chi square with p — 1 df 
(4 — 1)MS,/E(MS,) Chi square with g — 1 df 
(p — 1)(4 — 1)MS,,/E(MS,,) Chi square with (p — 1)(q — 1) df 


Principles Underlying Derivation of Expected Values for Mean Squares. 
To provide some insight into the principles underlying the derivation of the 
expected values for mean Squares, a nonrigorous derivation will be outlined. 
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From finite sampling theory, one has the following theorem: 
N — пох 
М—1п' 
Under random sampling without replacement after each draw, (1) relates 
the square of the standard error of a mean, o$, to the population size N, the 


sample size n, and the variance o of the variable X in the population. 
In (1), the population variance is defined to be 


(1) оу = 


X(X — uy 
2 = H 
Ox N 
If one uses as the definition of the population variance 
2 XX – и)? 
T м-1° 
then (1) has the form 
n \ ох 
: 5-0-9 


To simplify the notation, the present development will define а population 
variance using № — 1 as the divisor. This definition is consistent with that 
used by Cornfield and Tukey (1956). КУ a: 

In terms of the right-hand side of the structural model given in (1) of the 
last section, the mean of all observations made at level a; in an experiment is 


(3) А, = и. t +В +B +. 
In this notation Ё represents the average effect of all levels of factor B 
included in the experiment; B is constant for all levels of factor А. The 
notation ap, represents the average interaction effect associated with level a,; 
аф; may differ for the various levels of factor А. Тһе notation ғ; denotes 
the average experimental error associated with each А. 

The variance of 4, is defined to be 


р— 1 

Under the assumption that the terms on the right-hand side of (3) are 
Statistically independent, the expected value of the variance of the left-hand 
side of (3) will be equal to the sum of the variances of the terms on the right- 
hand side. Terms which are constants have zero variance. Thus, 

(4) E(s%) = 02 + 05 + %- 

The mean square due to the main effect of factor A may be written as 


MS, = 451. 
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Hence (4) becomes 
(5) E(MS,) = E(ngs) = ngE(s‘;) = ngo? + ngo= + ngo; 

The analogue of the theorem stated in (1) may now be applied to each of 
the variances in (5) which have a mean asa subscript. Thus, £isthe mean of 
the experimental error associated with the ng observations from which A; 
is computed. Therefore, 

2 па(1 — n|N)o; _ ( "Jas 
М. 


nqo, = E тА 


(The experimental error іп the basic linear model is associated with each 
element in the basic population from which the elements are drawn. Hence 
the sampling fraction associated with the experimental error is n/N.) 

The mean interaction effect in (4) represents the average over the q values 
of «В, present at levela; Hence ap, may be considered to be the mean ofa 
sample of size q levels from a population of size 0 levels of factor В. The 
analogue of the theorem in (1) now takes the form 

пдо?. = nq(1 — q/Q)os; 2 »( ja. 
ap q 
Since the main effects and interaction effects are assumed to be independent, 
restricting the sample of q values of аф to level a; does not invalidate the 
theorem summarized in (1). 

The expression for the expected value of the mean square of the main 

effect of factor 4 may now be written as 


(9 E(MS,) = [ m п) + »(i - НЕ + ngog. 


Each of the variances in (6) is associated with the parameters in the general 
linear model, whereas in (5) some of the variances were in terms of param- 
eters which are not explicitly in the general linear model. The purpose of 
defining MS, as nqs' now becomes clear. By adding the multiplier ng, one 
may conveniently express each of the variances in (5) in terms of parameters 
which are explicit in the linear model. 

Derivation of the expected values of the mean squares for the main effects 
of B and for the AB interaction follows the same general line of reasoning. 
The algebraic procedures whereby the various means are obtained from the 
experimental data are carried through in terms of the right-hand side of the 
basic structural model for an Observation. Then, using the assumptions 
underlying the model, one obtains the expected values of the mean squares 
by the principles that have just been outlined. 

The principles underlying the derivation of expected values of the mean 
squares will be illustrated forthecaseofa2 x 2 factorial experiment having 
n observations in each cell. In terms of a general linear model, the cell 
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means and the marginal means for the levels of factor B may be represented 
as follows: 
b b, 
ABy = н. ға) + ofa + ёп АВ» = н. + % + By + “бы Жез 
АВ, = u., + % + By + «Ви + ên АВ» = u., + % + Ba + авы + ёз 
В, =u +8 +B + a B, =p +% +В + Ва + En 
The difference between the two marginal means for factor B estimates the 
following parameters: 
B, — By = (By — В) + (@«Ву— x83) — (E1 — £3). 
Multiplying each side of the above expression by np gives 
ng(B, — B) = np(f — Ba) + про — #B.2) + np(£a — ёз). 
Since the terms on the right-hand side of the above expression are assumed 
to be statistically independent, the variance of the term on the left-hand side 
will estimate the sum of the variances of the terms on the right-hand side. 
Hence, 


E(MS,) = про? + npo; + пров 
Using the analogue of the relation given in (1), 


пра, = n( - Да, and пра = (1-1) 
Thus, E(MS,) = npo; + (i - 2), + (: - п) 


In most experimental designs the potential number of experimental units, 
N, is considered to be infinite. Hence 1 — n/N = 1. ь 

In a р xq factorial experiment having л observations per cell, least- 
squares estimates of the terms in the general linear model are obtained in a 
relatively simple manner, provided that all factors are fixed. Let m, а,, b; 
and ab, be estimates of the parameters и, о, Bi, and aß; The least-squares 
estimates are obtained from the condition 


(7) LID Kise — a — b; — abi; — my — minimum, 
iik 


subject to the restrictions that 
Уа,-0,  ZXb,—0, 54%; =0, Zab, = 


Corresponding restrictions on the parameters follow directly from the 
definitions under model I. 


The solutions to the normal equations obtained from (7) are 


т = б, 
a, = А — 6, 
b, = B, —6, 


ab, = AB, — A, — B; + 6. 
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These estimators are identical to those obtained by working directly from the 
linear model. When the cell frequencies are not equal, the least-squares 
estimators do not have this simple form. The latter case is considered in 
Sec. 5.25. 


5.7 Principles for Constructing F Ratios 


For either model I (all factors assumed fixed) or model III (all factors 
assumed random) the sampling distributions of the mean squares for main 
effects and interactions can be adequately approximated by independent 
chi-square distributions. For model II (some factors fixed, some factors 
random) all required sampling distributions are independent chi-square 
distributions only if highly restrictive assumptions on covariances are made. 
The principles for testing hypotheses to be presented in this section hold 
rigorously for models I and Ш. In practice, tests under model II follow 
the principles to be presented here; however, interpretations under model II 
require special care. Scheffé (1959, pp. 264, 288), in particular, has ques- 
tioned principles for constructing F tests of the type to be presented here 
for special cases of model II. 

The hypothesis that x, = a, == %p (that is, the hypothesis of no 
differences between the main effects of factor A) is equivalent to the hy- 
pothesis that ož = 0. This hypothesis is in turn equivalent to the following 
hypotheses: 

1. All possible comparisons (or contrasts) among the main effects of 
factor A are equal to zero. 

2. а. = fe, = "+ = p, = ц. 

To test this hypothesis against the alternative hypothesis that o2 > 0 
requires the construction of an Fratio. In terms of the expected value of 
mean squares, the F ratio for this test has the general form 

Ер) = 00, 
и 
Where и is some linear function of the variances of other parameters in the 
model and c is some coefficient. In words, E(MSyumerator) must be equal to 
E(MSgenominator) when c? = 0. 

For the test under consideration, the mean square in the numerator of the 
Fratio must be MS,. The mean square that is in the denominator depends 
upon the expected value of MS, under the proper model. For model I, 
the appropriate denominator for this F ratio is Мб; for model III, the 
appropriate denominator is MS,,. Thus, in order to form an F ratio in the 
analysis of variance, knowledge of expected value of mean squares under 
the appropriate model is needed. This, in essence, implies that the F ratio 
depends upon the design of the experiment, 
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If the numerator and denominator of an F ratio satisfy the structural 
requirements in terms of the expected values of the mean squares, and if the 
sampling distributions of these mean squares are independent chi squares 
when the hypothesis being tested is true, then the resulting F ratio will have 
a sampling distribution approximated by an Fdistribution. The degrees of 
freedom for the resulting F distribution are, respectively, the degrees of free- 
dom for the numerator mean square and the degrees of freedom for the 
denominator mean square. General principles for setting up F ratios are 
illustrated in Table 5.7-1. 


Table 5.7-1 Tests of Hypotheses under Model III 


Hypothesis 


Source of variation E(MS) being tested F ratio 


Main effect of factor A | o? + no2, + ngoz | Hy: о = 0 F = MS/MS 
Main effect of factor B | б? + naz, + праў | Hy: о =0 Е = MS,MS,, 
A x B interaction оў + nois Hy: 63, =0 F = MS,o/MSerror 


Error оз 
о cM А о ee eee 


The expected values in this table are those appropriate for model Ш. In 
terms of these expected values the F ratio used to test the hypothesis that 


o = 0 has the form 
E(F) = о? + лов Е ngo? i 
о; + по 

When the hypothesis being tested is true (when оң = 0), numerator and 
denominator of the expected value of the F ratio are identical. Thus, 
E(MS,) = E(MS,) when o2 —0. When zis greater than zero, the expected 
value of the F ratio is greater than unity by an amount which depends upon 
the term nqo2. From the structure of the F ratio, it can be less than unity 
only because of sampling error associated with MS, and М5л; for any single 
experiment each of these statistics may be independently either less than its 
expected value or greater than its expected value. Alternatively, the F 
ratio might be less than unity when some of the assumptions about the model 
do not hold. A e 

The F ratio appropriate for testing the hypothesis o7, = 0 has the struc- 
ture 

2 по? 
Е(Е) = ло А 
O; 

When the hypothesis being tested is true, numerator and denominator have 
the same expected value. When ož; > 0, the expected value of this F ratio 
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will be greater than unity. The F ratio as obtained from the experimental 
data has the form 
_ MS» 
MSerror 


Under the hypothesis that o2; = 0, the expected value of this F ratio is unity, 
When оз, is greater than zero, this F statistic has an expected value greater 
than unity by an amount which is a function of no2,. Under the hypothesis 
that оз; = 0, the sampling distribution of the F ratio is the F distribution 
having (p — 1)(q — 1) and pq(n — 1) degrees of freedom. 

To evaluate the power of a test using an F ratio requires a knowledge of 
the sampling distribution of the F ratio for specified nonzero values of the 
variance in the hypothesis beingtested. For nonzero values of this variance, 
the sampling distribution of the F ratio is not the ordinary F distribution but 
rather the noncentral F distribution. Convenient charts are available 
(Table B.11) to determine the sample size required for a predetermined 
power with respect to specified alternative hypotheses. A collection of 
charts for determining the power of F tests also appears in Scheffé (1959, 
pp. 438-455). 


5.8 Higher-order Factorial Experiments 


When a factorial experiment includes three or more factors, different 
orders of interaction are possible. For example, ina 2 x 3 x 5 factorial 
experiment, having 10 independent observations in each cell, the analysis of 
variance generally has the form given in Table 5.8-1. 


Table 5.8-1 Analysis of Variance for 2 x 3 x 5 Factorial Experiment 
Having 10 Observations per Cell 


Ss a Зе ee 


Source of variation Sum of squares df df (general) 
A main effects SS, 1 pei 
B main effects SS, 2 4-1 
С main effects SS, 4 зд 
АВ interaction SS.» 2 (p= Dg — 1) 
AC interaction з 4 (р — 10" — 1) 
BC interaction SS,, 8 (4—1) — 1) 
ABC interaction au 8 (р — 104 — 1I)tr = 1) 
Experimental error 
(within cell) SSerror 270 par(n — 1) 
Total SStotai 299 тра” — 1 


ie a fete жалы ы, ыы 


Ina three-factor experiment there are three interactions which involve two 
factors: А х B, A x С, Bx C. There is one three-factor interaction. 
The A x B x Cinteraction represents the unique effects attributable to the 
combination of the three factors, i.e., the effects that cannot be predicted 
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from a knowledge of the main effects and all two-factor interactions. The 
notation that was introduced for the case of a two-factor experiment can be 
extended as follows: 


Levels in population Levels in experiment 
Factor A 4,,4%..., 4һ...,ар CUP YS Ais +. йу 
Factor B by, by... bys- ++, bg лбу, Сал ыд, Бу 
Factor С Cis: Ĉio СК Се. босы дез ыы» vi Cp 


The definitions of fixed and random factor given in Sec. 5.2 also apply to 
factor С. Ifr = R, then factor C is a fixed factor. If the r levels of factor 
Cin the experiment are a random sample from the R levelsin the population, 
and if R is quite large relative to r, then factor C is a random factor. If 
the В levels are reduced to Ку levels by some systematic, nonrandom 
procedure, then factor С is considered fixed when r = Retrective- 

The notation for cell means used for the case of a two-factor experiment 
may also be extended. An observation on element m under treatment 
combination abc, is designated by X;,,,. Notation for cellmeansissum- 
marized in Table 5.8-2. Іп this notation system, и, designates the mean 


Table 5.8-2 Notation for Means in a Three-factor Experiment 
Lorie e m 


Population mean | Experiment mean 

Elements in cell abc;;;. Бик АВС, 
Elements under ab;; hij, АВ, 
Elements under ас; Ш.к АС» 
Elements under Бс; Hk ВС 
Elements under a; Fi.. 4 

Elements under 5; Hy, В, 

Elements under су. А.к б 


А 


of the N potential observations that could be made under treatment com- 
bination abc, Тһе notation и, designates the mean of the NR poten- 
tial observations that could be made under the treatment combinations 
абс, абс» . . . , айсың (N potential observations under each of the R 


treatment combinations). In terms of symbols, 


ик 
Hi. = ne 


(The subscript K is used here to indicate that the average is over all potential 
levels of factor C and not just those in any single experiment.) 
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The notation и, designates the mean of the МОК potential observations 
that could be made under the treatment combinations in which factor А is 
at level а). Thus, 


£Xugk 
cem T. 
Qu 


(The subscripts J and K indicate that the average is over all levels of factors 
Band C, not just those included in the experiment.) Similarly, ., denotes 
the mean of the potential NPQ observations that could be made under level 


Cx, that is, 
Dern 
dub. f 


Же РО 


The notation К refers to a level of factor С actually included in an experiment. 
If all the factors in an experiment are fixed factors, there is no need to make 
the distinction between 7, J, and K and i, j, and К. 

The numerical data given in Table 5.8-3 will be used to illustrate the 
definitions of main effects and interactions in a three-factor experiment. 
The data in this table include all levels of each of the factors, and the entries 
in the cells are the means of all the potential observations that could be in 
each of the cells, Thus the numerical entries represent the parameters for a 
specified population. 

In part i the cell entries may be designated by the symbol шуух. For 
example, uj; = 60. Іп symbols, 


УХ 211. 
M 


Hai = NE = 60. 


Hi.. 


The entries along the lower margin of part i represent the means of the re- 
spective columns. Thus the entry 40 at the extreme left represents the mean 
of all potential observations in which factor B is at level b, and factor C is at 
level сү; in symbols, this mean is Has Thus, 


Zun _ 20 + 60 
P 2 


Mu = = 40. 


Part ia summarizes the symbols for the entries in part i. 
In part ii each of the numerical entries in the cells represents a mean which 
has the general symbol иу x. For example, the entry in cell асуу is 


Ё Хел 60 + 40 + 50 50 
2.1 Q 3 . 
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Each of the marginal entries to the right of the cells in part ii has the general 
symbol иу. For example, the entry 30 is the mean of the entries in row 
ау; in symbols, иу = 30. Thus, 


Жек 49 420 
R 2 


Ma.. = 30. 


The entries along the bottom margin of part ii represent the means of all 
potential observations at specified levels of factor C. Thus, 


Part iia summarizes the symbols for corresponding entries іп part ii. 
In part iii each of the cell entries has the general designation u;;. The 
marginal entries at the right may be designated иу; the marginal entries 


Table 5.8-3 Population Means for 2 x 3 x 2 Factorial Experiment 
b, by by 


(i) a, | 20 0 
а; 60 40 40 20 50 30 


сі c, Mean b, by b, Mean 
(ii) x 40 20 30 (iii) а 10 20 60 30 
аз | 50 30 40 а; 50 30 40 40 г 
45 25 35 30 25 50 35 
ы % bs 
сі сә сі са сі c, Mean 
(ia) a | шап #n | Azn az | "usi Pa M.. 


az | Hg Mere | Мәл Far | Fasi Kaz Po.. 


дл Has | Ka Біз | Ka Kse 


с c, Mean b, by b, Mean 


Gia) a| ша дз Ha. 
Ga | Mai. Paa Mas. 


(iiia) 4j | Mu. Ға. Has, Мі. 
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at the bottom may be designated и z. Thus the entry 50 at the bottom of 
part iii is 
Хав. 
АИ ROO 
D 2 


Из. 


The main effects and interaction effects for a three-factor experiment will 
be defined in terms of the data in Table 5.8-3. Тһе main effect due to level 
a, is 


a = My, — и. = 30 — 35 = —5. 


In words, the main effect of level a, is a measure of the extent to which the 
mean of all potential observations at level а), averaged over all potential 
levels of factors B and C, differs from the grand mean of all potential obser- 
vations. In general the main effect of level a; is 


= uo Bo 
The main effect of level b, of factor B is by definition 


Bs = из. — u... = 50 — 35 = 15. 
In general the main effect of level 5; is 


B, — us, — n... 
The main effect for level с, of factor C has the general definition 
PM e Hs 


The interaction effect of level a; with level b, is the definition 
OB is = s, — B, — о — By = nis, — (и... + а, 4 Bj) 
= pu. — Mi. — Шу, + ш... 


In words, op; measures the extent to which the mean of all potential obser- 
vations differs from the sum of the main effects of a, and b ; and the grand 
mean. For example, 


Bis = 60 — 30 — 50 + 35 = 15. 
The interaction effect of level a, with c, is 


yi = Mie — Ш... — 9 — уь 
= шь — Ш. — Bar B. 
The interaction effect of level b; with c, is 
BY n= n — n. — B;— yx 
= Bu — Ba. — Bax Bu 
For example, the interaction of level Б» with level c, is 
Бул = 60 — 50 — 45 + 35 = 0. 
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The interaction of a;, Б,, and c, is defined to be 
ару = Mise — Ш... — (Buy + у + буу + ai + В; + у) 
= Hiie — Шу. — Pi — Paste БШ. Шз. Ba - М... 
In words, the interaction of three factors measures the difference between 
the mean of all potential observations under a specified combination, иу, 
and the sum of two-factor interactions, main effects, and the grand mean. 
The three-factor interaction is in essence a measure of the nonadditivity of 
two-factor interactions and main effects. For example, for the data in 
Table 5.8-3, the interaction of ау, bs, and c, is 


аруз = asa — Шз. — №2 T Haa F La. 
50 60 —20 —40 +30 
==) 

Equivalently, ^ ay; = pss — K... — (fs + ©з + зв + % + Bs + уз) 
= 50 — 35 — [15 + 0 +0 + (—5) + 15 + (—10)] 
==“ 

From the definitions of these effects it follows that 
Zar =0, Zh =0, Хук = 0; 
Хав) Хав Уук Levi« УВузк УВуук = 0; 
Т m T K J K 
Xsbrux Zobi Хулк 0. 


In each case the summation is over all the potential levels of the factors. 10, 
for example, k 5 К, then 
in #0. 


The variance due to the main effects of factor A is 

Уо] 

2 B 
P-—1 
Similarly, the variances due to the main effects of factors B and C are, 
respectively, 


Шар air... 
S091 25'— 35 


= 


Уб 
ides rt 
Хук 
а 21” 


When the main effects within any factor are all equal, the variance corre- 
sponding to these main effects will be zero. Hence the equality of main 
effects implies that the variance corresponding to these main effects is zero. 
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The variance due to two-factor interactions is defined as follows: 
Ха,» 
а-қ TI 
LU] WE RU 
(PESO 1) 
53307720 
IK 
бау ry 
(P — 0(К — 1) 
УУ(Вузк)? 
: Ln. JK 
Oby = to 
(Q — (К — 1) 
The variance due to the three-factor interaction is 
УУУ («бурук 
0%, = IJK А 
(pO — DOR 1) 
The variance due to the experimental error has the same general definition 
as that given for two-factor experiments; it is the variance of the measure- 


ments on the N potential elements within each potential cell of the experi- 
ment. Thus, for cell abc; 


È (Xim — Min 
2 M 
tk N = 1 
where the subscript M represents a potential element in the cell specified. 


Assuming that the variance due to experimental error is equal forall poten- 
tial cells in the experiment, the over-all variance due to experimental error is 


6, 


2 
n 
: PORS | 


Mean squares as computed from the actual experimental data are esti- 
mates of linear functions of these population variances. The mean Squares 
of main effects, as obtained from observed data, will estimate a linear func- 
tion of the variance due to the specified main effects, variance due to inter- 
actions effects, and variance due to experimental error. 

_ The extension of the notation and definitions to four-factor experiments 
is direct. For example, the main effect of level а, of factor D is 


O n = Hia — Hue 
This main effect is estimated by 
est (0,) = Б, – б. 
The notation est (ôm) denotes an estimate of the parameter 6,, obtained from 
the data in the experiment. The interaction effect associated with levels a; 
апа а, is 
абы = Him — B... — (0 + 4,,). 
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This interaction effect is estimated by 
est (x0,,) = ADim — 4; — D, + С. 
The interaction of levels a,, b;, and d,, is defined to be 
ад = Шут — М... — (Bis + об, + BO 5m) + (о + В, + Ôn). 
This interaction effect is estimated by 
est (z/0,,,) = ABD,;,, — АВ, — ADim — BD, + A, + B; + D, — С. 


This last estimate has the general form 
(3-factor mean) — X(2-factor means) + X(1-factor means) — С. 
The estimate of a four-factor interaction has the general form 
(4-factor mean) — X(3-factor means) + Х(2-Гасіог means) 
— X(1-factor means) 4- 6. 
For example, the interaction of levels 41, bs, cs, d, is estimated by 
est (2/5134) = ABC Dyog4 — (АВС + АВ”, + АСД + ВСР) 
+ (АВ, + АС + Арц + ВС» + ВВ, + CDu) 
— (A, + B, + C + D) +G. 
The term X(3-factor means) in the general expression for the estimate of a 
four-factor interaction effect includes all possible means of the form 


UVW, where ғ, s, and t are the subscripts for corresponding terms in the 
interaction effect being estimated. In a four-factor experiment the number 
of terms іп Х(3-Ғасіог means) is equal to the number of combinations of four 
things taken three at a time, which is symbolized ‚Су. This number is 
4342 
(oh nn =4. 
жы Ыр 
For а four-factor experimént the number of terms in the summation X(2- 
factor means) is 
4:3 
G= 75-5 
In a k-factor experiment, the estimate of the k-factor interaction effect 
has the general form 
(k-factor mean) — X[(k — 1)-factor means] + Z[(k — 2)-factor means | 
— X[(k — 3)-factor means] + Х[(К — 4)-factor means] 


The last term is +[(k — k)-factor means] = --G. Ifkisanevennumber, 
the last term is +G; if k is an odd number, the last term is —G. Thenumber 
of terms in the summation X[(k — 1)-factor means] is „С, _1; the number of 
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terms in the summation X[(k — 2)-factor means] is „С, . For example, 
(К <= 5, 


35243 5:4-3-2 
Cy. = 503 = ——— = 10; (OEC Gea iS, 
k*k-2 5-3 1.5.3 kVk-1 54 12-374 


5.9 Estimation and Tests of Significance for Three-factor Experiments 


For purposes of demonstrating the principles underlying the analysis that 
will be made, it is convenientto formulate a structural model for an observa- 
tion. For a three-factor experiment, the structural model has the form 


(1) Xim = (ADC ix) + Eisem 


The observation on element m under treatment combination абс is 
designated by the symbol X,,,. The symbol / (арс) denotes the hypo- 
thetically true effect of the treatment combination abc, as measured by 
population main effects and population interaction effects. The symbol 
ёп 18 the experimental error associated with the measurement on element 
т. In this context the experimental error is considered to be the difference 
between the observed measurement and the “true” measurement given by 
абс а) 

For purposes of the analysis that follows it will be assumed that the true 
measurement is a linear function of the main effects and interaction effects. 
Specifically, 


(2) Slabe) = p... + а + By + yy + nf + жуу, + буу, + «бу. 


Thus, (2) expresses the true measurement аз а linear function of main effects 
and interaction effects associated with a specific treatment combination. 
If (1) and (2) are combined, the resulting structural model is a generalization 
of the structural model given in Sec. 5.6 for a two-factor experiment, The 
assumptions that will be made about this model in the course of the analysis 
are direct generalizations of those summarized in Sec. 5.6. 

It will be assumed that p levels of factor A are selected at random from a 
population of P levels. It will further be assumed that an independent 
random sample of q levels of factor B is selected from a population of Q 
levels and that a third independent random sample of size r is selected from 
a population of R levels of factor C. The treatments in the experiment are 
the рд” combinations that result when each of the selected levels of one factor 
is combined with each of the selected levels from the other factors. For 
example, if level a, is used in combination with b, and сз, the resulting 
treatment is designated абс. In a p x д х r factorial experiment there 
аге pqr treatment combinations. 

It will also be assumed that a random sample of npgr elements is drawn 
from a specified population. Random subsamples of size л each are 
assigned to each of the pgr treatment combinations to be studied in the 
experiment. After administration of the treatments, each of the elements 
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is measured on a criterion of effectiveness (the dependent variable). The 
scale of measurement for the criterion is assumed to be given in terms of an 
experimentally meaningful unit. 

From the data obtained in the experiment, mean squares are computed to 
estimate variances due to the structural variables on the right-hand side of 
the structural model. For a three-factor experiment, the definitions of the 
mean squares for main effects, interactions, and experimental error аге sum- 
marized in Table 5.9-1. With theexception of multipliers and the ranges of 


Table 5.9-1 Definition of Mean Squares 


A main effect MS, -ттХА, - Суур — 1) 
B main effect MS,  -nprX(B, - С) — 1) 
C main effect MS, = mpg&(C, — G(r — 1) 
AB interaction MS, = "XB, — A, = B, + Gy(p — D4 — 1) 
AC interaction MS, = ng EAC = A, = €, + Суур ~ Yr — 1) 
BC interaction MS,, = прУў(ЙС,, - B, — €, + Gg — Dr — 1) 
ABC interaction М5 = nEXX(ABC;, — АВ, — АС, - ВС, 

+ Ay + В, + Cy = Суур —1)(4 = Wr = 1) 
Experimental error Мб, = EEEE Xim - ABC lpgr — 1) 
Mew ouo 102. luu e асе ee £o — 


summation, these definitions carry out on the experimental data the same 
operations that would be carried out on the population to obtain the 
variance due to main effects, interactions, and experimental error. 

For example, in the population the variance due to the main effects of 


factor A is 


Xu. — и.) 
"Ber! 


The mean square due to the main effects of factor A as computed from the 
data in the experiment is 
nqr (4, — Gy 
— í 5 


MS, = 


p-1 


As another example, in the population the variance due to the BC inter- 


action is 4 
DY — na. — и.к +.) 


na (Q — 1(R — 1) 
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The mean square due to the BC interaction as obtained from the data in the 


experiment is 
прУХВС» — B, — C, + Gy 

(ат 1) 
In each case, with the exceptions noted, the mean square duplicates for the 
data obtained from the experiment the definitions for the variance due to 
main effects and interactions in the population. 

The expected values of the mean squares computed from the experimental 
data are not, however, equal to a simple multiple of the corresponding 
variance in the population. For example, 

E(MS,) # пдғоз. 
In general, the expected value of the mean square due to the main effects of 
factor A will depend upon variance due to interactions with the levels of 
factor A, variance due to experimentalerror, and the variance due to the main 
effects of factor A. Similarly, 

E(MS,,) 5 про). 
In general the expected value of MS,, will depend upon the variance due to 
higher-order interactions which involve factors B and C, variance due to 
experimental error, and the variance due to the BC interaction, The 
expected values for the mean squares defined in Table 5.9-1 are summarized 
in Table 5,9-2. Part i of this table gives general values; part ii specializes 
these general values for specific cases. Case 2 is one of several possible 
special cases of the mixed model; for example, two factors may be random 
and one factor fixed; or one factor may be random, and two factorsmay be 
fixed. In all these cases it is assumed that the sample of elements observed 
under any treatment combination is a random sample of size n from a 
potentially infinite population of elements. 

Tests of significance in a three-factor experiment follow the same general 
rules as those for a two-factor experiment. Appropriate F ratios are deter- 
mined from the structure of the expected values of the mean squares which 
correspond to the design of the experiment. For the case of model I, all 
F ratios һауе MScrror for а denominator. For the case of model II given 
in Table 5,9-2, F ratios for main effects have the following structure: 


MS, . 
Hy: of=0, = ый, 


М5, 


Н: 0=0, =—; 


H: o=0; к” 


Tests on main effects for model Ш are considered in Sec. 5.15. 
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Under model П, the appropriate F ratio to test the hypothesis оз, = 0 is 


Eo Me 
MS, 


However, the appropriate F ratio for a test on the hypothesis that o7, = 0 
has the form 
— „М5, 
MSerror 


5.10 Simple Effects and Their Tests 


The definitions of simple effects will be cast in terms of a three-factor 
factorial experiment. However, the generalization to higher- or lower- 
order factorial experiments isdirect. Simple effects are associated with both 
main effects and interaction effects. The former are called simple main 
effects, the latter simple interaction effects. 

A p X q x r factorial experiment may be considered from different points 
of view. For some purposes, it can be considered as a set of gr single-factor 
experiments on factor A, each experiment being for a different combination 
of b; and с,. For other purposes, it can be considered as a set of r experi- 
ments, each experiment being a р х 4 factorial experiment for a different 
level of factor C. It will be convenient, for the present, to adopt this latter 
point of view. The analysis of the experiment for level с, takes the following 
form: 


Source of variation df 
A (for cj) p-1 
B (for c) Pigs 


AB (for с,) (p—1(q—1) 


The main effects for this single experiment, relative to the entire set of 
experiments, are called the simple main effects for level c,. Тһе interaction 
in this experiment is called the simple AB interaction for level c;. The 
variation due to the simple main effects of factor A, SS, tor e is related to the 
variation of the over-all main effect of factor А and the over-all AC inter- 
action. Specifically, 


255, fore, — SS, 4r SS,.. 
k 


In words, the sum of the variation due to the simple main effects of A at the 
various levels of factor C is equal to the variation due to the over-all main 
effect of factor A and the over-all AC interaction. When the variation due 
to the AC interaction is zero, the sum of the variation for these simple effects 
is equal to the over-all main effects. Analogously, it can be shown that 


Обр tore, == 55, == 55, 
к 
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Table 5.10-1 Definition of Effects 


Effect Definition 
Over-all main effect of a; a; ep. 
Simple main effect of a; for с оу) = Hk — Ux 
Over-all main effect of b; B; == 0: 
Simple main effect of b; for cy Bi) = Шз — uk 
Over-all interaction effect of ab;; а; = Hy, —H. —%  — By 
Simple interaction effect of ab;; for cp OB sco.) = Hije — H.k — ы — Piter) 


In words, the sum of the variations of the simple two-factor interactions is 
equal to the variation of the over-all two-factor interaction plus the three- 
factor interaction. 

The definitions of the simple effects, in terms of population means, are 
given in Table 5.10-1. It will be noted that simple effects have the-same 
general form as over-all factorial effects; simple effects, however, are re- 
stricted to a single level of one or more of the factors. The degree to which 
over-all effects approximate simple effects depends upon magnitudes of 
interactions. In the absence of interaction, over-all effects will be equal to 
corresponding simple effects. 

The definition of simple effects will be illustrated by the numerical data 
given in Table 5.10-2. The data in this table represent population means. 
An entry in a cell of part i has the general symbol x; an entry in a cell of 
part ii has the general symbol м); an entry in a cell of part iii has the general 


Table 5.10-2 Population Means for 2 х 2 x 2 Factorial Design 


сі Co 


9 a | 60 20 40 | 80 40 . 60 


(i) (iii) (iv) 


b, b, Mean с c, Mean сі c, Mean 


а 70 30 50 a,| 40 60 5 b| 30 50 40 
аҙ 10 10 10 а; 05-520 10 | 10 30 20 


Меап| 40 20 30 Mean} 20 40 30 Mean| 20 40 30 
P ESS 
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symbol и. ItisassumedthatP = О = R= 2. Thesimple main effects 
for factor A at levels c, and с, are most readily obtained from part iii. It 
will be found that 
«у = 50 — 30 = 20, оз = 10 — 30 = —20; 
946) =40—20= 20, as) = 0 — 20 = —20; 
ы = 60 — 40 = 20, aate) = 20 — 40 = —20. 
In each case the simple main effect is equal to the corresponding over-all 
main effect. This finding indicates that the two-factor interaction effects 
ay, are allzero and hencethat oz, = 0. Theay,,’saremostreadily obtained 
from part iii. 
«yy = 40 — 50 — 20 + 30 = 0, 
«Ур = 60 — 50 — 40 + 30 = 0, 
aya = 0 — 10 — 20 + 30 = 0, 
«уш = 20 — 10 — 40 + 30 = 0. 
Conversely, when two-factor interaction effects are found to be zero, cor- 
responding simple main effects will be equal to over-all main effects. 


The simple main effects for factor A at levels b, and b, are most readily 
obtained from part ii. These over-all and simple main effects are 


«у = 50 — 30 = 20, ® = 10 — 30 = —20; 
ыу = 70— 40 = 30, оь) = 10 — 40 = —30; 
ы) 730 —20— 10, о, = 10 — 20 = —10. 
In this case, simple main effects are not equal to corresponding over-all main 
effects. This finding indicates that the two-factor interactions «f,, are not 
all zero and hence that o2,0. The «f,/s are obtained from part ii. 
«Ви = 70—50—40+30= 10, 
«В = 30 — 50 — 20 + 30 = —10, 
ард = 10 — 10 — 40 + 30 = —10, 
«В» -10-10-20--30-- 10. 
The fact that the о, are not all equal to zero implies that the simple main 
effects are not equal to the corresponding over-all main effects. 


р The over-all two-factor interaction effects аге related to corresponding 
simple effects. Specifically, 


Ш 


(1) i; = Фр) — X. 
For example, 


В Aita — % = 10 — 20 10, 
ара = dan, — 95 30 — (—20) + —10. 


Similarly, 


(2) «В = Bian — В; 
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These last relationships indicate that two-factor interaction effects will be 
zero when simple main effects are equal to over-all main effects and, con- 
versely, that when two-factor interaction effects are zero corresponding 
simple main effects will be equal to over-all main effects. 

The simple two-factor interaction effects «fi; are most readily obtained 


from parti. For example, 
ора) = 60 — 40 30 + 20 = 10 
= fon — Шал — Far + Has 
оре» = 80 - 60 - 50 +40= 10 
= Hyg — Hia — Ила E M 


Thus it is noted that 
«В = f = 10, k= 1,2. 


In general, the over-all three-factor interaction effect is related to simple 
two-factor interaction effects by means of the relation 
(3) OB Yin = Bic. — ой 
Thus, when the simple interaction effect is equal to the over-all interaction 
effect, the corresponding three-factor interaction effect will be zero. 
For the data in part i in Table 5.10-2, 

аралау = P 
Hence from relation (3) it follows that «буу, = 0. In general, for the data 
in part i it will be found that 

ар = «Виз for all К. 

From relation (3) it follows that all «fy;s are equal to zero. Hence 
оу = 0. Analogous to (3) are the relations 
(4) аруа = «Уу — Vins 
(5) аВуць = Ву зка — BY ie 
That relation (3) implies (4) and (5) is illustrated geometrically in the next 


section, 
Estimates of simple effects are in general obtained in the same manner as 


corresponding over-all effects. For example, 


est a, = A; — С, 
whereas est aq) = АС — C, 
Similarly, est бл) = ВС» — Сь 


est in) = АВ, — В, 
The over-all interaction effect is estimated by 
est af, = АВ, — А; — В, + С, 
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whereas a simple interaction effect is estimated by 
est «б = АВС) АС, — BCy, + Cy. 


Variation due to simple effects is given by the general formula 
[est (simple effect) }*. 


Computational formulas for such variation are given in Chap. 6. 

Tests on simple effects depend upon expected values of mean squares 
for such effects. In general, appropriate expected values of mean squares 
for simple effects may be obtained from the expected values appropriate 
for experiments of lower dimension than the original experiment, i.e., by 
considering the simple effects as over-all effects of experiments of lower 
dimension. F ratios are constructed from expected values thus obtained. 
However, data from the entire experiment may be used to estimate mean 
squares, where such data are relevant. 


5.11 Geometric Interpretation of Higher-order Interactions 


Various orders of interactions may be represented geometrically. In 
terms of such representation, interesting aspects of the meaning of inter- 
actions will become clearer. In particular, it will be noted how simple 


Table 5.11-1 Population Means 


сі [7] 

by by by b, bs bs 
б” da 920.) 301.7 70. 40 A TEMO M SO... 20 
a,| 60 40 50 50 2 | 5407 20" 730. | 30 
40 35 60 45 Bouts: 401. 25 

(ii) (iii) 

Bisi balonda b b bs 
АТО ao e0 30 «| 40 35 60 45 
а| 50 30 40 40 2050009 “1517407 5225 
Зу 799 0. 25 Some 25. 150), 35 


interactions are related to over-all interactions. It will also be noted how 
simple interactions are related to higher-order interactions. The numerical 
data in Table 5.11-1 will be used for this purpose. It will be assumed that 
these data represent population means. 

Data necessary to plot the profiles for the BC interactions for the two 
levels of factor A are given in part i. The left-hand side of Fig. 5.1 
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represents the profiles of means which аге in the a; row of part i. The 
dotted line represents the means of the form и; the solid line represents 
means of the form иу. Thus the two profiles on the left represent the BC 
means forlevela,. These two profiles have the same shape (i.e., are parallel) 
This finding implies that all simple interaction effects Ву ка) are zero. 


Sea Lc 
70 m BC for combined a's 
M BC fora, | BC for a; e 


b bz b, b bz 5 b, bz b; 
Figure 5.1 
In general, the simple two-factor interaction at level a, has the definition 
(1) BY nta) = Fase — bas. йк} Mi 


For the data in Table 5.11-1, 
Вупа) = 20 — 10 — 40 + 30 = 0, 
Byr2ta) = 30 — 20 — 40 + 30 = 0, 
Вузма) = 70 — 60 — 40 + 30 = 0. 
The observation that the profiles on the left are parallel has the algebraic 


counterpart 
шал — Maye = constant for all j's. 


Hence the variation arising from such differences will bezero. This source 
of variation is that due to the BC interaction at a. , 
It is noted that the profiles of the BC means for level a, are parallel. This 


finding implies that all Вужау S аге zero. Hence 
55е at a, ar SSocata, =0+0=0. 
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In words, the sum of variations due to the simple interaction effects for the 
levels of factor A is zero. Since 


SSe ata; = SS,, Р 585.» 


when XSS» at a; = 0, both SS,, and SS,,, must be equal to zero. 


70 
AB for c, AB for combined c's 
AB for c; 


60 


50 


| 
bes, 75 bye UNT ы cat oa 
Figure 5.2 


The profiles of the over-all BC means, obtained from part iii of Table 
5.11-1, are given at the right in Fig. 5.1. The two profiles here are also 
parallel. This finding implies that 


Ву = 0,  forallj'sand к. 


Hence SS, must be zero. This latter result was implied by the fact that all 
the simple interaction effects of the буу s are zero. The fact that the 
BC profiles for a, and a, are parallel actually implies that the BC profiles for 
the combined levels of factor A are also parallel. To summarize, when the 
simple interactions of two factors at various levels of a third factor are all 
zero, the corresponding two-factor and three-factor interactions will also 
be zero. 

It is, however, possible for two-factor interactions to be nonzero and yet 
have zero three-factor interaction. The profiles іп Figure 5.2 illustrate this 
case. These profiles represent the 4B means for the two levels of factor 
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C (Table 5.11-1). The profiles of AB for с, are not parallel; the profiles 48 
at c, are not parallel. Hence 


YSSunatey #0. 
k 


Although the profiles within each level of factor C are not parallel, the a, 
profile for су is parallel to the a, profile for the combined levels of factor С. 
Similarly, the a, profile for c; is parallel to the a, profile for the combined 
levels of factor C. Тһе a, profiles for each level of factor C are also parallel 


150 
140 


130 [- 


120 [- 


AB for combined c's 


110 AB for с, 
100 
90 


80 


to the a; profile for the combined data. This finding implies that 55, will 
bezero. The fact that the 4B profiles for the combined data are not parallel 
indicates that SS,, will not be zero. 

To summarize the implications of the profiles with respect to the three- 
factor interaction, the latter will be zero when (1) the profiles of the two- 
factor means are parallel within each level of the third factor or when (2) the 
pattern of profiles for the two-factor means is geometrically similar to the 
pattern for the combined levels. In order that patterns be geometrically 
similar, corresponding profiles must be parallel. — || 

A set of profiles in which the three-factor interaction 1s nonzero but the 
two-factor interaction is zero is given in Fig. 5.3. (These profiles are not 
based upon the data in Table 5.11.1.) The profiles within level c; are not 
parallel to each other, nor is the profile at a, parallel to the a; profile for the 
combined levels of factor C. Hence the three-factor interaction is nonzero. 
However, the AB profiles for the combined levels of factor C are parallel. 
Thus SS,, = 0. 
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The geometric relationships between the simple two-factor interactions, 
the three-factor interaction, and the over-all two-factor interaction are seen 
more clearly by drawing three-dimensional profiles. The following data 
will be used for illustrative purposes. 


а | 6 20 | 80 40 
ар \ 0-77 10:15-20-7— 20 


(These data are the same as those in part i of Table 5.10-2.) These data 
represent the population means for a 2 x 2 x 2 factorial experiment. A 


geometric representation of the patterns formed by these means is given in 
Fig. 54. The left-hand panel represents the four means in which factor 
Cis at level c}. This panel is denoted AB for c}. The right-hand panel 
represents the four means in which factor C is at level с». The line (60,20) 
in the left panel is parallel to the line (80,40) in the right panel. The line 
(0,0) in the left panel is parallel to the line (20,20) in the right panel. Тһе 
vertical lines in these panels are automatically parallel by the method of con- 
struction. Since corresponding sides of the left and right panels are parallel, 
the two panels are geometrically similar. 
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Geometric similarity of the left and right panels forces similarity of the 
front and back panels as well as similarity of the top and bottom panels. In 
analysis-of-variance terms, similarity of panels implies that the profiles of 
the simple two-factor interactions have the same patterns for all levels of a 
third factor. Thus the fact that the panel AB for с; is similar to the panel 
AB for c, implies that the simple AB interactions have the same pattern for 
c as they do бог с». This in turn implies that SS,,, = 0. 

The fact that $$ = 0 is implied by the similarity of any two opposite 
panels—in turn, similarity of one pair of opposite panels implies the simi- 
larity of all other pairs. This geometric fact illustrates why one cannot dis- 
tinguish between the interactions (AB) x C, (AC) x B, and (BC) x A. 
Similarity of the left and right panels actually implies that (АВ) x Cis zero. 
But similarity of the left and right panels forces similarity of the front and 


Figure 5.5 


back panels. The latter similarity implies that (AC) x B is Zero. Thus, 
when (АВ) x Cis zero, (АС) x B must also be zero; one implies the other. 
More emphatically, one is not distinguishable from the other. 

When the three-factor interaction is zero, inspection of individual panels 
will provide information about the two-factor interactions. In Fig. 5.4 
the line (60,20) is not parallel to the line (0,0). This implies that the 
variance due to the AB interaction is not zero. In the front panel, the line 
(20,40) is parallel to the line (0,20.) This implies that the variance due tothe 
AC interaction is zero. In the top panel, the line (60,80) is parallel to the 
line (20,40). This implies that 55,, = 0. _ When the three-factor inter- 
action is not zero, inspection of the individual panels does not provide 
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information with respect to the two-factor interactions. The individual 
panels in pair must be averaged in order to obtain information relevant to the 
over-all two-factor interaction. 

Figure 5.5 illustrates a case in which all main effects are zero, all two-factor 
interactions are zero, but the three-factor interaction is not zero. Since 
the left and right panels are not geometrically similar, the simple AB inter- 
actions at the two levels of factor С are not zero. Hence the variation due 
to the three-factor interaction is not zero. When corresponding points 
on the left and right panels are averaged, the top line of the average profileis 
(20,20) and the bottom lineis (0,0). Hence іп the over-all AB profile thetop 
and bottom line are parallel; thus the variation due to the over-all AB inter- 
action is zero. 


5.12 Nested Factors (Hierarchal Designs) 


Consider an experiment conducted to evaluate the relative effectiveness 
of two drugs with respect to some specified criterion. Suppose that the 
design calls for the administration of drug 1 to n patients from each of 
hospitals 1, 2, and 3; drug 2 is to be administered to n patients from each of 
hospitals 4, 5, and 6. This design can be represented schematically as 
follows: 

TE = = кч с=ш= чыт p eos 
Drug 1 Drug 2 


Hosp. 1 Hosp. 2 | Hosp.3 | Hosp. 4 Hosp. 5 | Hosp. 6 


n n n n n п 


The difference between the mean effect of drug 1 and the mean effect of 
drug 2 will be due in part to differences between the unique effects associated 
with hospitals 1, 2, and 3 and the unique effects associated with hospitals 4, 
5, and 6. 

The unique effects associated with hospitals 1, 2, and 3 are confined to 
drug 1 whereas the unique effects associated with hospitals 4, 5, and 6 are 
confined to drug 2. Effects which are restricted to a single level of a factor 
are said to be nested within that factor. In the experimental design being 
considered, the hospital effects are nested under the drug factor. Since a 
given hospital appears only under one of the two drugs, there is no way of 
evaluating the interaction effect between the hospital and the drug. Before 
such an interaction effect can be evaluated, each hospital must appear under 
both levels of the drug factor. 

Thus, іп a two-factor experiment having one factor nested under the other, 
the interaction effect cannot be evaluated. For the general case of a two- 
factor experiment in which factor B is nested under factor A, the structural 


model is = 
АВ; = u., + a; + Bio + E. 
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The notation у indicates that the effect of level 5; is nested under level а). 
Note that no interaction term of the form «у, appears in the model. 
Inferences made from this type of design assume implicitly that the variation 
associated with this latter interaction is either zero or negligible relative to 
the variation associated with the main effects. 

The analysis of variance for the design outlined at the beginning of this 
section takes the following form: 


df for 
d Bo general case 
A Drug 1 P 
В (w. ау) Hospitals within drug 1 2 21 
B (м. аз) Hospitals within drug 2 2 12-1 
Within hospital 6(n — 1) pan — 1) 


a iS 


The expected values of the mean squares in this analysis are as follows: 
сыза eee 


Source of variation df E(MS) 
A Pil o? + nD,65 + nqog 
B (pooled) «p — 1) о + по) 
Experimental error 

(within cell) pan —1) | о? 


NEED l 
The symbol D, is used to designate the expression | — g/Q. Numerically 
D, = 0 when 4 = О (that is, when factor B is fixed), and D, = | when 
q/Q = 0 (that is, when factor B is random). To test the hypothesis that 
oi = 0, 


Е- MS; : when factor B is random; 
MS, 

Е- MSS 4 when factor В is fixed. 
MSerror 


By way of contrast, in a two-factor factorial experiment each level of one 
of the factors is associated with each level of the second factor. If the 
design outlined at the beginning of this section were changed to a two-factor 
factorial experiment, the new design could be represented schematically as 
follows: 


Hosp. 2 | Hosp.3 | Hosp. 4 Hosp. 5 | Hosp. 6 


Hosp. 1 


n[2 n[2 


n[2 n[2 n[2 п/2 
"|2 n[2 


n[2 n[2 n|2 n[2 


This factorial experiment requires л subjects from each of the hospitals, 
but 7/2 of the subjects from each hospital are given drug 1, and n/2 of the 
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subjects are given drug 2. In many cases the two-factor factorial experi- 
ment is to be preferred to the two-factor design in which the hospital factor is 
nested under the drug factor—particularly in those cases in which an inter- 
action effect might be suspected. However, there are some instances in 
which the experimenter may be forced to use a design in which one factor is 
nested within another. 

As another illustration of nested effects, consider the following experi- 
mental design: 


Drug 1 Drug 2 


Hosp. 1 | Hosp. 2 | Hosp.3 | Нозр.4 | Hosp. 5 | Hosp. 6 


Category 1 n 
Category 2 n | n n n n 


This design calls for a sample of п patients in category 1 and л patients in 
category 2 (random samples) from each of the hospitals. Patients from 
hospitals 1, 2, and 3 receive drug 1; patients from hospitals 4, 5, and 6 
receive drug 2. In this design the hospital factor is nested under the drug 


Table 5.12-1 Analysis of Three-factor Experiment in Which Factor В is Nested 
under Factor A 
a er sor m НЦ 


Source of variation df df for general case 
A Drugs 1 p 
В (w. ау) Hospital w. drug 1 2 1. 
4 pq = 0) 
B(w. ay) Hospital w. drug 2 2 4/--:1 
C Categories 1 nel 
AC Drug x category 1 (p — 1iXr — 1) 
(Bw. A) x C Hospital x category 4 pq = 00 — 1) 
Error Within cell 12(n — 1) pqrin = 1) 
Total 11 + 12(2 — 1) npgr — 1 


—M— 1+ 


factor. Since some patients from each category receive drug 1 and some 
patients from each category receive drug 2, the category factor is not nested 
under the drug factor. Further, since patients from each of the categories 
are obtained from each hospital, the category factor is not nested under 
the hospital factor. 

The design that has just been sketched may be considered as a three-factor 
experiment, the factors being drugs (A), hospitals (B), and categories (C). 
In this case, factor B is nested under factor A, but all other relationships are 


NESTED FACTORS (HIERARCHAL DESIGNS) 187 


those of a factorial experiment. The model for this type of experiment has 
the following form: 


АВС, = B+ tit В + Ye + ®у Py gx ae 


No interaction in which the subscript i appears twice occurs in this model. 
That is, the interactions о), and «буу do not appear. Hence the 
utility of this type of design is limited to situations in which such interactions 
are either zero or negligible relative to other sources of variation of interest. 

The analysis of variance for this design takes the form given in Table 
5.12-1. In making tests by means of the F ratio, the appropriate ratios are 
determined from the expected values of the mean squares. The latter аге 


Table 5.12-2 Expected Values of Mean Squares, Factor B Nested 


under Factor A 
ee ee НЕБЕ СЕЕ ББ 


Source of variation df Expected value of mean square 
A p-4 o? + nD,D,9%, + nqD,o3, + nr Dao} + ngrok 
Ву. А pq = 1) o? + пр,о%,, + nro 
[o r= а + проз, + пдр оё, + npqo; 
AC (p — 100 —1) | oe + nDo, + nqos, 
(Bw. A)C pq = 00 — 1) |o? + поз, 
Within cell | pain — 1) о? 


Е ________—_—————— 


in part a function of whether the factors іп the experiment are fixed or 
random. General expressions for the expected values of mean square 
appropriate for this type of design are given in Table 5.12-2. In this table 
the symbol D, is used to designate 1 — p/P, D, is used to designate | — 410, 
and D, is used to designate 1 — r/R. Each of these D’s is either 0 or 1 
depending, respectively, on whether the corresponding factor is fixed or 


random. › 
In this last table, the source of variation B w. A (the main effects due to 


factor B, which is nested within factor A) is actually the sum of the following 
main effects: 


df 
В №. а gd 
Bw. a, Ме 
Sum = Bw. A pq = 1) 


Similarly, the interaction (B w. A) х Crepresentsa pooling of the following 
interactions: 


df 
(Bw. а)С (q = Dr — 0 
(Bw. a,)C (4 = Ir —1) 


Sum = (Bw. А)С pq = e = 1) 
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If this pooled interaction is used in the denominator of an F ratio, the varia- 
tions which are pooled must be homogeneous if the resulting F ratio is to 
have an F distribution when the hypothesis being tested is true. 

As another illustration of nested effects, consider an experiment in which 
the subjects have been classified as follows: 


es 
Company 1 Company 2 


Dept. 1 Dept. 2 Dept. 3 ; Dept. 4 


Job1 | Job2 | Job3 | Job4 | Jobs | lob6 | Job7 ЕСІ 


n n n n n | n n | n 


Suppose that n people from each job are included in an experiment in 
which attitude toward a retirement plan is being studied. This design may 
be considered as a three-factor experiment in which the department factor 
(B) is nested under the company factor (4). The job factor (C) is nested 
under both factors Band A. This type of design is referred to as a hierarchal 
design. In a three-factor hierarchal experiment, factor B is nested under 
factor A, and factor C is nested under both factors Band А. (The design in 
Table 5.12-1 is only partially hierarchal, since factor C was not nested under 
either factor A or factor B.) 
The model for a three-factor hierarchal experiment has the form 


АВС» = и... + o + By + Vrun + ge 


The notation ууу) indicates that factor C is nested under both factors А and 
B. It should be noted that no interaction terms appear explicitly in this 
model. The expected values of the mean squares for this design are sum- 
marized in Table5.12-3. The numerical values of the D's in these expected 
values depend upon the respective sampling fractions for the levels of the 
factors. 


Table 5.12-3 Expected Values of Mean Squares for Three-factor 
Hierarchal Experiment 


——— — — ———— au —— o — —Á— JL lie 


Source of variation df Expected value of mean square 
A p-1 оё + пр,а + nrD,o5 + ngro® 
Bw. А pq = 1) о + про? + пкоў 
C w. (A and В) par — 1) og + по? 
Experimental error | pgr(n — 1) o 

Total "mpgr-d 


———— es 


The expected values of the mean squares for a completely hierarchal 
design are readily obtained, once the expected value of the mean square for 
the factor within which all other factors are nested is determined. The 
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expected value for each succeeding factor is obtained from the one above by 
dropping the last term and making D in the next to the last term unity. 
The following design represents a partially hierarchal design: 


City 1 City 2 


School 1 | School 2 | School 3 | School 4 


Method 1 n n n n 
Method 2 n n n n 


Suppose that the purpose of this experiment is to evaluate the relative 
effectiveness of two different methods of teaching a specified course. Since 
both methods of training are given within each of the schools and within 
each of the cities, there is no nesting with respect to the methods factor. The 
school factor is, however, nested within the city factor. The expected values 
of the mean squares for this design have the form given in Table 5.12-4. This 


Table 5.12-4 Expected Values of Mean Squares for Three-factor 
Partially Hierarchal Experiment 


Source of variation df Expected value of mean square 

A Methods p-1 o? + nD,o2, + ДЕРІ 
+ паго? 

B Cities 4-1 o? + nD,D,o, + пе уаз з 
+ npD,o? + npro}, 

Cw. В Schools within cities qr —1) o? + nDo, + про? 

AB (р —104—1)| o3 + п,о?, + тоу 

A x (Cw. B) qp—1r—-)| 9+ nož, 

Within cell рфг(п — 1) оё 


i 00m E ТРАНША ДЫ ЕН te 
design enables the experimenter to eliminate systematic sources of variation 
associated with differences between cities and differences between schools 
within cities from the experimental error—at the cost, however, of reduced 


degrees of freedom for experimental error. у 
As still another example of a useful design involving a nested factor, con- 


sider an experiment with the following schematic representation: 


Method 2 


Method 1 
Person 1 | Person 2 | Person 3 | Person 4 | Person 5 | Person 6 
Period 1 
Period 2 | 
Period 3 
Period 4 | 
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In this design persons 1, 2, and 3 are observed under method of training 1; 
criterion measures are obtained at four different periods during the training 
process. Persons 4, 5, and 6 are observed at comparable periods under 
training method 2. This design may be considered as a three-factor experi- 
ment in which the person factor is nested under the methods factor. If 
methods is considered to be factor A, persons factor B, and periods factor 
C, the model for this design is 


АВС, = и. +9 + Ву + У + Om By stare + Eine 
For this design there is only one observation in each cell of the experiment. 
Hence there is no within-cell variation. The analysis of variance is identical 


in form with that given in Table 5.12-2 if the within-cell variation is deleted. 
The expected values of the mean squares are also identical to those given in 


Table 5.12-5  Three-factor Partially Hierarchal Design 
(Factors A and C fixed, B random) 


Source of variation df Expected value of mean square 
A Methods ред 0% + ғаз + qro5 
Bw.A People within methods Pq = 1) оў + rog 
С Periods "-і 9; + оў„ + рдо? 
AC Method x period (p — Dr S] of + 03, + 02, 
(B w. A)C pq — Xr — 1) o? + ай, 


Table 5.12-2, with n set equal to unity. Specializing this design to the case 
in which factors A (methods) and C (periods) are fixed and factor B (persons) 
is random, one obtains the expected values given in Table 5.12-5. Іп this 
table, the source of variation due to B w. A represents the pooled variation 
of people within methods. A homogeneity assumption is required for this 
pooling. The interaction term (B w. A)C also represents a pooling of 
different sources of variation. The homogeneity assumption required for 
pooling in this case is equivalent to the assumption that the correlation 
between periods be constant within each of the methods. The F tests for 
the analysis in Table 5.12-5 have the following form: 


Hus o? 0, F MSnmnetnoas : 
MSpeopie w.methods 
Hy: оў = 0, F= MSperiods ; 
MSs w. аус 
H3: 92, 0, F MSmetnoa х period 
MSs w. лус 


, Rather than considering this last experiment as a three-factor partially 
hierarchal experiment, there are other means for classifying this type of 
experiment. In particular, it can be considered as a two-factor one in 
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which there are repeated measurements on one of the factors. This latter 
type of classification will receive more extensive treatment in Chap. 7. 


5,13 Split-plot Designs 

The split-plot design has much in common with partially hierarchal 
designs. The term split-plot comes from agricultural experimentation in 
which a single level of one treatment is applied to a relatively large plot of 
ground (the whole plot) but all levels of a second treatment are applied to 
subplots within the whole plot. For example, consider the following 
design, in which the levels of factor A are applied to the whole plots and the 
levels of factor B are applied to the subplots: 


а, а а аз 
Plot 1 Plot 2 Plot 3 Plot 4 
һал, фан га. = 
by by | | by | | 8 
Е }; | ы | ШЕ 
| 
| b ba | | bs by 


In this design, differences between the levels of factor A cannot be esti- 
mated independently of differences between groups of blocks. That is, 


est(%, — ж) = (plot 1 + plot 3) — (plot 2 + plot 4) 
= А, = Ay, 


For this reason the variation due to the levels of factor A is part of the 
between-plot effects. However, comparisons among the levels of factor 
Bare part of the within-plot variation. From the information within each 
of the plots, estimates of the main effects due to factor B may be obtained. 
These estimates are free of variation due to whole plots. Іп the analysis of 
this type of design, sources which are part of the whole-plot variation are 
usually grouped separately from those which are part of the within-plot 
Variation, 
The model for this experiment is 
Хк = M.. + 04 F Ten F By + Abis H тид + ё 

The notation m, designates the effect of plot k within level а). (This nota- 
tion indicates that the plot effects are nested within the levels of factor А.) 
The notation туду designates residual subplot effects. For the special, but 
frequently occurring, case in which A and B are fixed factors and the plots 
are a random sample from a specified population of plots, the analysis of 
Variance assumes the form given in Table 5.13-1. In this analysis, each of 
the p levels of factor A is assigned at random to np plots. Within each 
plot, the levels of factor B are assigned at random to the subplots. The 
expected values of the mean squares in this table are actually identical in 
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form with those given in Table 5.12-5 if plots are assigned the role of factor 
B in this latter design. 

There is a distinction between the designs usually placed in the hierarchal 
category and designs in the split-plot category. In the hierarchal designs, 
generally (but not always) all except one of the factors are modes of classi- 
fying the experimental units rather than treatments which are administered 
to the units by the experimenter. Such modes of classification are set up 
primarily to eliminate, in part, differences among the experimental units 


Table 5.13-1 Expected Values of Mean Squares for Split-plot Design 
(A and B fixed, plots random) 


тк ретте т ағ Expected value of 
mean square 
Between plots np —1 
i fia o + qo + ngo® 
Plots w. a, 
nt pln — 1) оё + qos 
Plots w. a, 
Within plots np(q — 1) 
B 4-1 of + оў, + проф 
AB (р = Dq — 1) of + оў, + по? в 
В x plots w. a, 
e р = 0 –1)) 0 + o} 
B х plots w. a, 


from the experimental error. Interaction between the treatment and the 
classifications is generally considered to be negligible. The usual hierar- 
chal experiment may be regarded as a single-factor experiment with controls 
on the grouping of the experimental units. 

In contrast, the split-plot design has two treatment factors. Whole plots 
are the experimental units for one of the factors, whereas the subplots are the 
experimental units for the second factor. If this distinction is made, the 
final design considered in the last section should be placed in the split-plot 
rather than the partially hierarchal category. 

By inspection of the design outlined at the beginning of this section it is 
not obvious that the AB interaction is free of variation due to whole-plot 
effects. To demonstrate that such is the case, consider only the whole-plot 
effects in each of the following means: 


Mean — Whole-plot effects 
A, ™ + тә 

B 7, + ma + т; + та 
AB, m +m 


G ті + My + т; + т, 
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An estimate of the interaction effect associated with treatment combination 
ab, is 
est «fy, = AB, — A, — B, + С. 

The whole-plot effects associated with est «f, are 

(т + тз) — (Gm + тз) — (m + mg + лз + та) + (m + T + mg + m) = 0. 
Thus the whole-plot effects for this interaction sum to zero. In general, 
the whole-plot effects for est ай), will sum to zero for alli’s and j’s. Hence 
the variation due to whole-plot effects does not influence variation due to the 
AB interaction. 

It should be noted, however, that the variation due to the AB interaction 
is not free of variation, if any, associated with the B x plot interaction. 
This latter source of variation may be analyzed as follows: 


B x plot «(рғ = 1) 


АВ (р = Dr – 1) 
В х plots №. a, 


pq — Ir — 1) 
В x plots w. а„ 

This analysis of the B x plot interaction shows the AB interaction as part 
ofthis source of variation. This actually follows directly from the fact that 
the effects of factor A are confounded with groups of plots. { 

Several variations of the split-plot design are possible. Опе such varia- 
tion is a double-split or split-split-plot design. Іп this type of design each 
level of factor A is assigned at random to n whole plots. (A total of np 
whole plots is required for this design.) Each of the whole plots is divided 
into g subplots. Тће д levels of factor B are then assigned at random to the 
subplots within each whole plot, and each subplot is divided into r sub- 
subplots. Ther levels of factor Care then assigned at random to each of the 
sub-subplots, Part of a split-split-plot design is illustrated schematically 
below: 


а а; 
bey, | bez. 
ОООО | ............. 
Бе, | bes 

9-7 УСА le inte 
bey bes 
= = Wn 

dt jani] р 
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Thus the experimental unit for factor A is the whole plot; the experimental 
unit for factor B is the subplot; and the experimental unit for factor C 
is the sub-subplot. Since the sub-subplots are nested within the subplots 
and the latter are nested within the whole plots, factor C is nested under the 
subplots and factor B is nested under the whole plots. Factor A is partially 
confounded with groups of whole plots. 

The model for this type of design may be written 


, 
Xinn Oty + ту + Bs + OB is + Rn + Ye + ауа + Вуљ + «Ву 
" 
F тез) F Eisem 


The notation z;,;; designates the residual sub-subplot effect. (This latter 
may also be regarded as the pooled ут and буты, interaction effects.) 
The analysis of this type of experiment takes the form given in Table 5.13-2. 


Ht 


Table 5.13-2 Expected Values of Mean Squares for Split-split-plot Design 
(A, B, C fixed, plots random) 


Source of variation df piece value or m 
square 
Between whole plots пр —1 
A ped o? + qro? + пдго? 
Whole-plot residual pi — 1) оў + qro? 
Within subplots np(q — 1) 
B еі 02 + ro, + пргоў 
AB (р = 009 – 1) оў + тоў, + про? 
Subplot residual pn — 1g — 1) o? + ro?, 
Within sub-subplots прд(т — 1) 
с LI оў + o2, + npqo?, 
AC (p — Qr — 1) оў + o2, + ngog, 
BC (q — Dr — 1) а? + c2. + праў, 
АВС (р = Iq — Dr — 1) оў + о?, + поё, 
Sub-subplot residual pqin — Yr — 1) о? + c, 


П 


The expected values of the mean squares given in this table are for the special 
case in which factors A, B, and C are considered fixed and plots are con- 
sidered random. The numbers of levels of factor A, B, and C are, respec- 
tively, p, q, and ғ; each level of factor A is assigned to л whole plots. 

The expected values of the mean squares indicate the structure of appro- 
priate F ratios. It will be noted that the sub-subplot residual mean square 
is the appropriate denominator for the main effect and all interactions in- 
volving factor C. More extensive consideration of split-plot designs is 
given in connection with designs having repeated measures on the same 
people. These designs form the subject matter of Chap. 7. 
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The design summarized in Table 5.13-2 involves pgr cells. Within each 
cell there are n independent observations. Hence the within-cell variation 
has pqr(n — 1) degrees of freedom. Itis of interest to note that the sum 
of the degrees of freedom for the residuals is the degrees of freedom for the 
within-cell variation. 


Source df 
Whole-plot residual p(n — 1) 
Subplot residual p(n — 1)(4 — 1) 
Sub-subplot residual pan - Ayr — 1) 

Total рға — 1) 


5.14 Rules for Deriving the Expected Values of Mean Squares 


Given an experiment in which the underlying variables can be assumed to 
satisfy the conditions of the general linear model, the expected values of the 
mean squares computed from the experimental data can be obtained by 
means of a relatively simple set of rules. Although these rules lead to an 
end product which has been proved to be statistically correct when the 
assumptions underlying the general linear model are met, the rules them- 
selves provide little insight into the mathematical rationale underlying the 
end product. Theassumptions that underlie the general linear model under 
consideration have been stated in Sec. 5.6. The rules which will be out- 
lined in this section are those developed by Cornfield and Tukey (1956). А 
similar set of rules will also be found in Bennett and Franklin (1954). 

As the rules will be given, no distinction will be made between random and 
fixed factors. However, certain of the terms become either 0 or | depending 
upon whether an experimental variable corresponds to a fixed or random 
factor. For purposes of simplifying the expressions that will result, the 
following notation will be used: 


D, —1— p[P, i=1,...,p. 
D, = 1— 40, ј= 1, ‚@- 
D, —1— r[R, К-і1,...,һ 


If p = P, that is, if factor A is fixed, then D, is zero. On the other hand, if 
factor A is random, D, is unity. Similarly, the other D's are either 0 or 1 
depending upon whether the corresponding factor is fixed or random. Іп 
the application of these rules to designs of special interest, the appropriate 
evaluation of the D's should be used rather than the D’s themselves. Тһе 
general statement of the rules is followed by a series of examples. —— — 

Rule |. Write the appropriate model for the design, making explicit in 
the notation those effects which are nested. Р 

Кие 2. Construct a two-way table in which the terms in the model (except 
the grand mean) are the row headings and the subscripts appearing in the 
Model are the column headings. Тһе number of columns in this table will 
be equal to the number of different subscripts in the model. The number 
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of rows will be equal to the number of terms in the model which have sub- 
scripts. The row headings should include all subscripts associated with a 
given term in the model. 

Rule 3. To obtain the entries in column i, 

enter D, in those rows having headings containing an / which 
is not nested, 

enter unity in those rows having headings containing an i 
which is nested, 

enter p in those rows having headings which do not contain 
an i. 

Rule 4. To obtain entries in column /, 

enter D, in those rows having headings containing a j which 
is not nested, 

enter unity in those rows having headings containing a j 
which is nested, 

enter q in those rows having headings which do not contain 
a j. 

Rule 5. Entries in all other columns follow the general pattern outlined 
in rules Запа 4. For example, the possible entries in column k would be 
D,, unity, and r. 

Rule 6. The expected value of the mean square for the main effect of 
factor A is a weighted sum of the variances due to all effects which contain the 
subscripti. Ifa row heading contains a subscript i, then the weight for the 
variance due to this row effect is the product of all entries in this row, 
the entry in column i being omitted. (For nested effects, see rule 10.) 

Rule 7. The expected value of the mean square for the main effect of 
factor B is a weighted sum of the variances due to all effects which contain 
the subscript j. If a row heading contains a j, the weight for the variance 
due to this effect is the product of all entries in this row, the entry in column 
Ј being omitted. (See rule 10.) 

Rule 8. Тһе expected value of the mean square for the AB interaction is 
a weighted sum of the variances due to all effects which contain both the 
subscripts i and j. If a row heading contains both the subscripts / and j, 
then the weight for the variance corresponding to this effect is the product of 
all entries in this row, the entries in both columns i and j being omitted. 

Rule 9. In general, the expected value of a mean square for an effect 
which has the general representation XYZ,,,, is a weighted sum of the 
variances due to all effects in the model which contain all the subscripts u, 
v, and w (and possibly other subscripts). If a row heading does contain 
all three of the three subscripts и, v, and w, then the weight for the variance 
due to the corresponding row effects is the product of all entries in this row, 
the entries in columns и, v, and и being omitted. 

Rule 10. Ifan effect is nested, the expected value of its mean square is a 
weighted sum of variances corresponding to all effects containing the same 
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subscripts as the nested effect. For example, if the main effect of factor B 
appears as 0, in the model, then the relevant effects are those which con- 
tain both the subscripts ¿and j. Similarly, if the term буу, appears in the 
model, in considering the set of relevant variances, row headings must con- 
tain all three of the subscripts i, j, and k. 

The application of these rules will be illustrated by means ofap x q x r 
partially hierarchal factorial design having n observations per cell. In this 
design, it will be assumed that factor В is nested under factor 4. This type 
of design may be represented schematically as follows: 


a а 


bim bam bza bio Lm Dy(o) 


Cy п observations in each cell 


The notation b; indicates that factor B is nested under factor A. Тһе 
structural model for this design may be written as 


Хк =... + о + By + Ук F Ayi F Dy so F Emin 

In accordance with rule 1, the notation in the structural model makes 
explicit those effects which are nested. Thus, the notation fj indicates 
that the levels of factor В are nested under the levels of factor 4. The nota- 
tion emus indicates that the unique effects associated with an observation 
on element m in cell арс, are nested under all effects; i.e., the experimental 


error is nested under all factors. 
Table 5.14-1 Expected Value of Mean Squares for Partially Hierarchal 
Factorial Design 


|) оза БЪДИ C een 
Effect i j k m E(MS) 


o? + nD,D,65, + nqD,os, + nr D,o5 + nqra; 


5%) D, 4 ғ п 

Bin 1 DII n | o + nD,03, + nro} 

Ук Р 4 D, n |9 + про, + проз, + про? 
«у Ds ud D, n | а + nD,0%, + поз, 

Вк | 1d D; we n |а + поз, 

Ema | 1 1 1 1 а? 


_`—________—_——————— 


Тһе two-way table called for by rule 2 is given in Table 5.14-1. The row 
headings аге the terms in the model, the term g.. being omitted. The 
column headings are the different subscripts that appear in the model. 

The entries that appear in column iof this table were obtained in accord- 
ance with rule 3. Since rows ж, and уд contain a subscript iwhich is not in 
parentheses, the entry in column i for each of these rowsis D,. Since rows 
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Во, BYstayes AN єг) each contains the subscript i in parentheses, the entry 
in column i for each of these rows is unity. All other entries in column / are 
р, Since none of the remaining row headings contains the subscript i. 

The entries that appear in column / were obtained in accordance with 
rule 4. The entries that appear in column m follow from rule 5—with 
one exception. From rule 5 the entry in row Emt) would be D,. Since 
the experimental error will always be considered a random variable, 
D, = 1 — (n/N) will always be considered equal to unity. Thus the n 
observations that appear within a cell in this design are considered to be a 
random sample from a potentially infinite number of observations that 
could be made within a cell. 

In accordance with rule 6, the expected value of the mean square for the 
main effect of factor A is a weighted sum of the variances due to all row 
effects which contain the subscript i. Thus E(MS,) is a weighted sum of 
the following variances: 

ОУ ОД rh ің, 
The weight for o7, which is given by the product of all terms in row æ; 
the term in column i being omitted, is nqr. The weight for o5, which is the 
product of all terms in row уу, the term in column i being omitted, turns 
out to be nrD,. The weight for o2, is the product of all terms in row жу 
the term in column і being omitted; this weight is ngD,. The properly 
weighted sum for E(MS,) appears to the right of row о. Thus, 


E(MS,) = o? + nD,D,os, + nq D,o?, + nrD,o? + ngro?. 


In words, the expected value of the mean square for the main effect of factor 
A is a weighted sum of a set of variances; the variances included in this set 
and the weights for each are determined by application of rule 6. 

Since, in this design, factor B is nested under factor A, the main effect of 
factor B is denoted by the symbol 2). Іп this case the expected value of 
the mean square for the main effect of factor B is a weighted sum of the 
variances of terms containing both the subscripts Тапа]. These variances 
are 


0% 0j, and oj. 


The weight for оз is the product of all terms in row Bio, the terms in columns 
iand j being omitted; this weight is nr. The weight for o, is the product of 
all terms in row дусол, the terms in columns i апа j being omitted. This 
weightisnD,. The expected value for the mean square of the main effect of 
factor B (which is nested under factor A) is given at the right of the row у). 

The other terms under the heading E(MS) in Table 5.14-1 were obtained 
by application of rules9 and 10. For example, the variances that enter into 
the expected value of MS,, are 


05 о), and o 


ay" 


D 


QUASI F RATIOS 199 


These are the variances corresponding to those effects which contain the 
subscripts į and k. 

The expected values for the mean squares in Table 5.14-1 may be 
specialized to cover specific designs by evaluating the D’s. For example, if 
factors A and С are fixed and factor B is random, D, = 0, D, = 0, and 
Р, = 1. For this case, 

E(MS,) = o? + nro + nqro;. 
This expected value was obtained from the corresponding expected value in 
Table 5.14-1 by evaluating the D,, D,, and D,. By a similar procedure, 
E(MS,) = оў + поў. 
The expected values of mean squares for specialized designs may be ob- 
tained by evaluating the D’s prior to the application of the rules. For the 


Table 5.14-2 Expected Value of Mean Squares for Partially 
Hierarchal Factorial Design 
(Factors A and C fixed, factor B random) 


Effect | i jf ok m E(MS) 
[^ 0 q r n o? + пгоў + nqro; 
В) 1 1 r n | of + nro 

Ук Р 4 0 n | оў + под, + npo; 
ауа 0 4 0 n | а + no}, + пдо?, 
BY icine 1 1 0 n | о + no}, 

Ém(ijk) 1 1 1 1 оё 


special case under consideration, the expected values of the mean squares 
are derived in Table 5.14-2. The expected values of the mean squares 
shown on the right are identical to those obtained by specializing the E(MS) 
given at the right of Table 5.14-1. For special cases of interest to the experi- 
menter, the simplest method of deriving the expected values of the mean 
squares is that illustrated in Table 5.14-2. 


5.15 Quasi F Ratios 


In some cases the appropriate F ratio cannot be constructed by direct 
application of the rules based upon expected values of mean squares. | For 
example, consider the case of a p x q x r factorial experiment having л 
observations per cell. If all factors are random, the expected values of the 
Mean squares are those given in Table 5.15-1. (These expected values 
Correspond to model III in Table 5.9-2.) / ) А 

In practice preliminary tests on the model (which are discussed in Sec. 
5.16) are made on the higher-order interactions before proceeding with the 
tests on main effects. (In many cases which arise in practice, tests оп main 
effects may be relatively meaningless when interactions are significantly 
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different from zero.) Ifnone of the interaction terms may be dropped from 
the model, then no single mean square can serve as a denominator in a test 
on main effects due to factor 4. The proper denominator for a test in this 
case should have the expected value 

о? + no35, + ngoz, + пгозр. 
None of the individual mean squares in Table 5.15-1 has this expected value. 
However, by adding and subtracting certain of the mean squares one may 


Table 5.15-1 Expected Values of Mean Squares 


(Model ІП) 
Source of variation E(MS) 
A o? + по? + той, + пгоз + nro; 
B ot + no2,,, + проў„ + nra; + пргоў 
с в + позз, + про), + ngoz, + npqo; 
AB о + по? gy + пгоўӊ 
АС оў + nois, + тда, 
BC оў + по? з, + проў, 
ABC o? + по? в, 
Experimental error өз 


obtain a composite mean square which has the required expected value. 
One such composite, assuming each of the mean squares is independent of 
the others, may be constructed as follows: 


E(MS,,) = o; + пош, + qa, 
E(MS,;) = o; + пош, 
--Е(М5,,) = —o; = поз, 


П 


ИД 


E(MS,, + М8, — М5) = o? + no, + nqo$, + nro, 


A quasi F ratio, which has the proper structural requirements in terms of 
expected values of mean squares, for a test on the main effect of factor A is 


EC MS, 
MS,, + MSas — MS. 


The symbol F' is used for this ratio rather than the symbol F. Since the 
denominator is a composite of different sources of variation, the sampling 
distribution of the F' ratio is not the usual F distribution, although the latter 
distribution may be used as an approximation. The denominator of this F’ 
ratio calls for subtractinga mean square. This could lead to the possibility 
of obtaininga negative denominator. Accordingto the population model, 
it is not possible to have a negative denominator in terms of parameters. 
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However, in terms of the estimates of these parameters it is possible to have 
a negative denominator. 

The following quasi F ratio avoids the possibility of a negative denomi- 
nator and still satisfies the structural requirements for a test on the main 
effect of factor A: 

pr — М® М5, 
MSas + MS, 


In terms of expected values of the mean squares, this ratio has the form 


duel 20? + 2па5, + пдо?, + nro + ngroz 
20° 4- 2no3s, + ngo, + пғаф 
Under the hypothesis that o? = 0, E(F") = 1. Similarly, the following 


F" ratio has the structural requirements in terms of the expected values of 
mean squares for a test of the hypothesis that оў = 0: 


_ MS, + М5 


Е" = 
MS, + MS,. 


Although these F” ratios satisfy the structural requirements in terms of 
expected values, the sampling distributions of these F” ratios can only be 
roughly approximated by the usual F distributions, provided that special 
degrees of freedom are used for numerator and denominator. Suppose 
that the F” ratio has the following general form, 


мүл 
mira 
w+x 
where и, v, w, and x are appropriate mean squares. Let respective degrees 


of freedom for these mean squares be fu Fos fu, and fo Then the degrees of 
freedom for the numerator are approximated by the nearest integral value to 


(и +0). 
lfa) + IS) 
The degrees of freedom for the denominator are approximated by 
(м + xy ‹ 
Qf) + СА) 


This approximation to the F distribution and the associated degrees of 
freedom are those suggested by Satterthwaite (1946). 
The F' ratio has the following general form: 


Ғ- 4 5 
w+x—v 
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If the F distribution is used to approximate the sampling distribution of the 
F' statistic, the degrees of freedom for the denominator are 


(м--х- 0)? 
WIS) + CF) + OIS) 


5.16 Preliminary Tests оп the Model and Pooling Procedures 


In deriving the expected values for the mean squares, extensive use is made 
of a model which is assumed to be appropriate for an experiment. The 
model indicates the relevant sources of variability. A question might be 
raised about why certain terms appear in the model and why other possible 
terms are omitted. Ifin fact there is no interaction effect of a given kind in 
the populationof interest, why should such an interaction term be included in 
themodel? Including such an interaction term in the model can potentially 
affect the expected values of several mean squares. The latter in turn deter- 
mine the structure of F ratios. 

Decisions about what terms should appear in the model and what terms 
should be omitted are generally based upon experience in an experimental 
area and knowledge about what are reasonable expectations with respect to 
underlying sources of variation—in short, subject-matter information. 
All sources of variation not specifically included in the model are in reality 
classified as part of the experimental error. In most cases the latter 
variation is the residual variation after all controlled sources have been 
estimated. Previous experimentation in an area may indicate that no 
interaction between two factors is to be expected; hence in designing a 
new experiment in this area such an interaction term may be omitted from 
the model. However, any variation due to this interaction, if it exists, is 
automatically included as part of the experimental error or automatically 
confounds other estimates, depending upon the experimental design. 

Lacking knowledge about interaction effects from past experimentation, 
one might ask whether or not data obtained in a given factorial experiment 
could be used as а basis for revising an initial model. The specification of 
the parameters in the initial model could be considered incomplete or left 
open to more complete specification. Tests designed to revise or complete 
the specification of parameters to be included in the model are called pre- 
liminary tests on the model. Such tests are particularly appropriate when 
one is dealing with experiments in which interactions between fixed and 
random factors or interactions between random factors are potentially in 
the model. Such terms may turn out to be denominators for F ratios. If 
such terms have a relatively small number of degrees of freedom, corre- 
sponding F ratios will have very low power. Since the mean squares for 
interactions between two or more fixed factors can never form the de- 
nominator in an F ratio, preliminary tests on such interactions are not 
generally required. 
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The procedure of making preliminary tests on higher-order interactions 
before proceeding with tests of lower-order interactions and main effects 
may be regarded as a multistage decision rule. Depending upon the out- 
come of a sequence of tests, the parameters in the model become more 
completely specified; in turn the expected values for the mean squares are 
revised sequentially. 

If a preliminary test does not reject the hypothesis that the variance due 
to an interaction effect is zero, one proceeds as if this variance were actually 
zero and drops the corresponding term from the model. The expected 
values of the mean squares are then revised in accordance with the new 
model, and additional preliminary tests, if necessary, are made. Care must 


Table 5.16-1 Expected Values of Mean Squares for Model in (1) 
(A fixed, B and C random) 


Source of variation df E(MS) 
A Pak оў + по? д, + ngog, + пгоў + nqroz 
B ql 0° + проў„ + пргоў 
с r-1 o? + про? + npqos 
AB (p = Ig — 1) оў + no25, + nro 
AC (p = Jr —1) 92 + па? з, + поз, 
BC (q — 1r — 1) o? + проў,„ 
АВС (p = Dq = er —1) | o + noi, 
Within cell pan — 1) оз 


be taken in such tests to avoid type 2 error, i.e., accepting the hypothesis of 
zero interaction when it should be rejected. Type 2 error can be kept 
numerically small by making preliminary tests at a numerically high type 1 
error, that is, x = .20 or .30. і 

The sequence іп which preliminary tests on the model аге made will be 
illustrated for the case ofapxqxr factorial experiment in which factor 
A is considered fixed and factors B and C are considered random. The 
initial model for this experiment is the following: 


(D Xinm = + 6 + В; + yy + ofa + ®у + Вуљ + о Уа еті: 


The question of whether or not all the interactions between random 
factors and between fixed and random factors should be included in the 
model is left unanswered for the time being. Assuming the complete model, 
the expected values of the mean squares are those given in Table 5.16-1. 
Suppose that the experimental work is completed and the analysis-of- 
variance table prepared. Tests of hypotheses depend upon the model and 
associated expected values. The model as given in (1) is tentative and sub- 
ject to change. Associated with this model are the expected values given in 
Table 5.16-1. Inspection of this model indicates that the appropriate 
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denominator for tests on the hypotheses 07; = 0 and oł}, = 0 is М5, cu. 
Suppose both of these tests are made at the 25 per cent level of significance. 
Suppose that these tests do not reject the respective hypotheses and that a 
priori information indicates no good basis for expecting that such inter- 
actions exist. On these grounds обу, and буу are now dropped from the 
model. 

The revised model now has the form 


(2) Хк» = B 04 В, +H уь + abis + ау, + Em 


The term г now includes the interaction terms which were dropped from the 
modelin(1). The revised expected values of the mean squares are obtained 


Table 5.16-2 Expected Values of Mean Squares for Model in (2) 


Source of variation df E(MS) 
A p-1 9; + пдо?,, + nro25 + паго? 
B 4-1 a? + пргоҙ 
G r—1 оў + прдо? 
AB (p = Iq – 1) оў + nrois 
AC (р = Dr —1) о? + ngo® 
BC 4 ау 
АВС 
Within | residual | npgr — pq — pr + p | o? 
cell 


—————————— —————————— 


by dropping ozp, and оз, from the terms in Table 5.16-1. When thisis done, 
MS,, MS,,, and MS, en are all estimates of c?. These three mean 
squares may be pooled to provide a single estimate of о? as shown in Table 
5.16-2. The pooled estimate is 

55, + SS, + SSw. сеп 
(a — Ir — D + (p — Da — Dr — 1) + par(n — 1) 
The degrees of freedom for the denominator are the sum of the degrees of 
freedom for the sources of variation which are pooled. This sum is equal 
tompqr — pq — pr + p. 

Inspection of Table 5.16-2 indicates that the hypotheses o}, = 0 and 
02; = 0 may both be tested with MS,, as a denominator. Suppose that 
these tests are made at the 25 per cent level of significance. Suppose that 
the outcome of these tests does not reject the hypothesis that ozy = 0 but 


pud the hypothesis that o2; = 0 is rejected. The revised model now has the 
orm 


(3) Xing = И + 04 + Bs + Ye + Abis + Eim: 


The experimental error in (3) includes variation due to «y as well as the 
interaction terms included in model (2). The expected values associated 


MSres em 
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with (3) are given in Table 5.16-3. These expected values may be taken as 


those appropriate for final tests. 
If there is a priori evidence to indicate interaction between factors, pre- 
liminary tests on such interactions should in general be avoided and tests 


Table 5.16-3 Expected Values of Mean Squares for Model in (3) 


Source of variation | df EMS) 
A р 1 оў + по? з + ngroz 
В | 4-1 оў + пргоҙ 
C r-41 оў + прдо3 
AB (р-1)4- 1) 92 + тоў; 
Residual (pooled AC, 

BC, ABC, and 

within cell) npqr —pq-—r+1 | о 


should be made in accordance with the original formulation. When pre- 
liminary tests are made, only interactions in which random factors appear are 
considered in such tests. (Only interactions with random factors can po- 
tentially form the denominator of an F ratio.) If an adequate number of 
degrees of freedom (say, 20 or more) is available for the denominator of F 
ratios constructed in terms of the original model, preliminary tests should 
also beavoided. However, if the denominator of an F ratio constructed in 
accordance with the original model has relatively few degrees of freedom 
(say, less than 10), in the absence of a priori knowledge preliminary tests are 
in order. 

A numerical example will be used to illustrate the pooling procedures 
associated with Tables 5.16-1, 5.16-2, and 5.16-3. Part i of Table 5.164 
represents the analysis of variance for a 4 x 2 x 3 factorial experiment 
having six observations per cell. Assume factor A fixed and factors B and 
Crandom. Ifthe model in (1) is assumed, the expected values of the mean 
Squares given in Table 5.16-1 are appropriate. Preliminary tests on 025, 
and oj, are made in parti. These tests indicate that the hypotheses that 
955, = 0 and ož, — Ocannot be rejected at the 25 per cent level of significance. 
Hence the corresponding terms are dropped from the model in (1). 

The residual term in part ii is obtained as follows: 


Source | ss df MS 
BC 18.00 2 
ABC 72.00 6 
Within cell 24000 | 24 
Residual (ii) | 330.00 32 10.31 


Pama ct Pee MM NE eee 
The variation due to residual ii corresponds to variation due to experi- 
mental error in the model given in (2). The expected values for the mean 


206 


DESIGN AND ANALYSIS OF FACTORIAL EXPERIMENTS 


Table 5.16-4 Numerical Example of Pooling Procedures 


(п =2,p = 4,9 =2,r = 3) 


First-stage preliminary tests: Hy: 025, = 0; Hy: o}, = 0 


Source SS df MS Е. 
А 120.00 3 40.00 
B 60,00 1 60.00 
с 40.00 2 20.00 
АВ 96.00 8 32.00 
(i) АС 72.00 6 12.00 
ВС 18.00 2 9.00 0.90 Е (2,24) = 1.47 
АВС 72.00 6 12.00 1.20 Е (6,24) = 1.41 
Within 
cell 240.00 24 10.00 
Second-stage preliminary tests: Hy: o2; = 0; Hy: o3, = 0 
Source | ss | d | ms | F 
A 120.00 3 40.00 
B 60.00 1 60.00 
C 40.00 2 20.00 
(ii) AB 96.00 3 32.00 3.10 F (3,32) = 1.44 
AC 72.00 6 12.00 1.16 Е (6,32) = 1.39 
Residual 
(ii) 330.00 32 10.31 
Analysis for final tests: 
Source | Ss df MS FO 
A 120.00 3 40.00 1.25 F ẹ(3,3) = 9.28 
B 60.00 1 60.00 5.67 F (1,38) = 4.10 
(ау € 40.00 2 20.00 1.89 F (2,38) = 3.25 
AB 96.00 3 32.00 3.02 Е (3,38) = 2.85 
Residual 
(iii) 402.00 38 10.58 


Squares in part ii are those given in Table 5.16-2. Second-stage tests are 

made in accordance with the latter expected values. As a result of the tests 

in partii, the variation associated with the AC interaction is pooled with the 

experimental error, but the variation due to the A B interaction is not pooled. 
The residual term in part iii is obtained as follows: 


Source ss df MS 
AC 72.00 6 
Residual (ii) 330.00 32 
Residual (iii) 402.00 38 10.58 
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Statistical tests in part iii are based upon the expected values in Table 
5.16-3. The denominator for the F ratio in the test on the main effect of 
factor A is MS,,; all other F ratios have MS,,, as a denominator. As 
distinguished from parts і and ii, the tests in part iii are final tests rather than 
preliminary tests; hence the difference in the level of significance. 

The sampling distributions of the statistics used in tests made following 
preliminary tests are actually different from the sampling distributions 
associated with tests which are not preceded by preliminary tests. What is 
really required in the second stage of a sequential decision procedure is the 
sampling distribution of the statistic in question under the condition that 
specified decisions have been made in the first stage. Using sampling 
distributions which do not have such conditions attached generally intro- 
duces a slight bias into the testing procedure. Specified percentile points 
on the unconditional sampling distribution are probably slightly lower than 
corresponding points on the conditional sampling distribution. That is, 
а statistic which falls at the 95th percentile point when referred to the uncon- 
ditional distribution may fall at only the 92d percentile point when referred 
to the conditional distribution. Hence use of the unconditional distribu- 
tions for sequential tests probably gives tests which have a slight positive 
bias; i.e., the type 1 error is slightly larger than the specified level of sig- 
nificance. 

By way of summary, it should be noted that there is no widespread agree- 
ment among statisticians on the wisdom of the pooling procedures which 
have been discussed in this section. Those statisticians who adhere to the 
“never pool” rule demand a completely specified model prior to the analysis 
of the experimental data. This position has much to recommend it.. The 
inferences obtained from adopting this point of view will be based upon 
exact sampling distributions, provided that the model that has been specified 
Is appropriate for the experiment. + 4 

Using data from the experiment to revise the model introduces contin- 
gencies which are difficult to evaluate statistically. However, working from 
a revised model which more adequately fits the data may potentially provide 
More powerful tests than those obtained from the “never pool” rule. 
Admittedly the change in power cannot be evaluated with precision. The 
conservative attitude toward pooling adopted in this section attempts to take 
middle ground: one departs from the initial model only if the experimental 
data strongly suggest that the initial model is not appropriate. The posi- 
tion taken by the author in this section is quite close to that adopted by Green 
and Tukey (1960). It is also in line with the point of view developed by 
Bozivich et al. (1956). 


5.17 Individual Comparisons 


Procedures discussed in Chap. 3 for making individual and multiple 
comparisons between means can be extended rather directly to factorial 
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experiments. A significant over-all F test on a main effect, for example, 
indicates that one or more of a multitude of possible comparisons is sig- 
nificant. The specific comparisons which are built into the design or sug- 
gested by the theoretical basis for the experiment can and should be made 
individually, regardless of the outcome of the corresponding over-all F test. 
Seldom, if ever, should a posteriori comparisons be made when the over-all 
F is nonsignificant. (In this context an a posteriori comparison is one 
suggested by deliberate inspection of the data.) Should such comparisons 
be made, statistically significant outcomes should be interpreted with 
extreme caution. The experimenter should not hesitate to describe fully 
all aspects of his experimental results. 

The procedure for making individual comparisons will be illustrated for 
the case of a p х q factorial experiment having n observations per cell. To 
test the hypothesis that о; = «; against the two-tailed alternative hypoth- 
esis «; 52 «y, one may use the test statistic 


"MEE pS 
р 4) 
MS4,-4,, 
The best estimate of MS; ;, depends upon whether factor В is fixed or 
random. For the case in which factor B is fixed, 


М54,-4,- 2М5», сеп сеце 
nq 
For the case in which factor B is random and 025 5 0, 
MS4,-1, = ue 
па 


(When the AB interaction is significant, one is generally interested only in 
the simple main effects. Hence tests on the over-all main effects in the 
presence of significant interaction seldom are made in practice.) For the 
first case, the degrees of freedom for MS ;, ;, are pq(n — 1); for the 
second case, the degrees of freedom are (p — D(g — 1). In either case 


Either one-tailed or two-tailed tests may be made by using the / statistic. 
An equivalent, but computationally simpler form of the F statistic for the 
case in which factor B is fixed, in terms of totals rather than means, is 
— А? 
pa =A) 

2ngMSvw. сеп 
For the case in which factor B is random, 
— (Ai — А)? 


F ^ 
2nqMS,, 
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Regardless of whether factors A or В are fixed or random, a test on the dif- 
ference between cell means has the form 
_ (АВ) — AB, 
2пМ$у. сеп | 

In making the tests indicated in this section, the denominators use data 
from all cells in the experiment, not just those from which the means have 
been computed. If the experimental error is homogeneous, use of infor- 
mation from all cells to estimate experimental error is justified. If, how- 
ever, there is a sufficient number of degrees of freedom (say, over 30) for the 
estimation of experimental error from only those cells which are used in the 
estimation of the means being compared, this latter estimate is to be pre- 
ferred to the pooled estimate from all cells. 

A test on the difference between two means is a special case of a compari- 
Son among several means. For p x q factorial experiments having л 
Observations in each cell, the mean square for a comparison among the 
main effects of factor A has the general form 

(eis + 645 ++ + 4)? 

тс? 
where Ус, = 0. The case in which c, = 1, с = —1, and all other с 
are zero defines the mean square 

(4i = Ау 
2nq 
The case in which c, = 1, c; = 1, and су 2 defines the mean square 
(Ai + А, = 243) 

6nq 

The general mean square corresponding to a comparison among the main 

effects of factor B has the form 
(с.В, + В, +: + BY 
np% 
where Ус, = 0. Similarly, the mean square for a comparison among the 
simple effects of factor A for level 5; has the form 
(cyABy, + AB; + 7: + с,4В,)? | 
nè? 

For the case in which A and B are fixed factors, comparisons planned 
before the data were obtained (and assuming that the number of such com- 
parisons is small relative to the total number possible) can be made by use 
of the following statistic: 


, 


i MScomparison 


F 
M Sw. cell 
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Under the assumption that the hypothesis being tested is true, this statistic 
has a sampling distribution which is given by an F distribution. The 
critical value for this test is F, ,[l, pq(n — 1)]. In cases where the com- 
parisons are large in number or of the a posteriori type, the appropriate 
critical value, as suggested by Scheffé (1960), for tests on main effects due to 


factor A is 

(p = 1)Е,-„[(р — 1), pa(n — 1]. 
An analogous critical value for comparisons among the main effects of 
factor B is 2 

(а — 1), -„[(9 — 1), ра(п — 1)]. 
The critical value for comparisons of the а posteriori type among cell 
means is 

(pq — 1)F,-a[(pq — 1), pa(n — 1)]. 


To illustrate the magnitude of the difference between the two types of 
critical values, consider the case in which р = 10 and рд(л — 1) = 40. For 


= 01 
E : F (1,40) = 5.42 


is the critical value for an a priori type of comparison. Foran a posterior 
type 
УР OF (9,40) = 9(2.45) = 22.05. 


The difference between the two critical values is quite marked—but then the 
difference in the underlying logic is also quite marked. Whenever a 
relatively large number of tests of significance are to be made, or whenever 
comparisons suggested by the data are made, the usual sampling distribu- 
tions (that is, t or F) associated with tests of significance no longer apply. 
The critical value for the a posteriori comparison is much larger because 
the sampling distribution appropriate for this type of comparison must take 
into account sources of variation which are not relevant to the a priori com- 
parison. 

In making comparisons between all possible pairs of ordered means 
within a logical grouping, the Tukey or the Newman-Keuls procedures may 
be adapted for use. For example, all pairs of means within a single row or 
column of a p x q factorial experiment may be compared by means of the 
Tukey procedure. If the number of degrees of freedom for estimating the 
standard error is sufficiently large (say, over 20), a single row or a single 
column of the p x q factorial experiment may be considered as a single- 
factor experiment and the methods in Chap. 3 may be applied directly to 
this part of the factorial experiment. Alternatively, if the assumption of 
homogeneity of within-cell variance is met, an estimate of the standard error 


of a cell mean is given by 
s= ae. cell y 


The degrees of freedom for this estimate are pq(n — 1). 
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If all possible pairs of means corresponding to the main effects of factor 
A are to be compared, and if factor B is fixed, then 


3.4 / М5. сеп 
Sp —— 
nd 
Numerical applications of comparisons of this kind are given in Sec. 6.2. 


5.18 Partition of Main Effects and Interaction into Trend Components 


In some cases arising in practice it is desirable to divide main effects as 
well as interactions into components associated with functional forms 
assumed to account for trends in the criterion responses. As an example, 
considera3 x 4 factorial experiment in which the levels of both factors may 
be regarded as steps along an essentially underlying continuum. The 
magnitudes of the criterion scores within each of the cells may be considered 
to define a response surface. It is frequently of interest to explore regions 
on this response surface, particularly when one is seeking an optimum com- 
bination of treatments. 

Main effects and interactions ina 3 х 4 factorial experiment may be sub- 
divided as indicated below: 


Source df 
A 2 
Linear 1 
Quadratic 1 
B 3 
Linear 1 
Quadratic 1 
Cubic 1 
AB 6 
Linear x linear 1 Quadratic x linear | 1 
Linear x quadratic 1 Quadratic x quadratic 1 
Linear x cubic 1 Quadratic x cubic 1 


The actual equation (or surface) which will describe each of the cell means 
exactly has the form 
X = ey + GA + eB + tA? + cB? + сВ? + АВ + суАВ? + c, AB! 
+ суА?В + сА?В* + c ATP. 
Each term, except су, in the equation corresponds to a term in the partition 


of the 11 degrees of freedom into trend components. For example, АВ? 
Corresponds to the linear x quadratic interaction; A*B* corresponds to the 
quadratic x cubic interaction. 
The components of variation defining the 4 main effects have the general 
form 
(Ec, 40) 
A (component) — nga 


212 DESIGN AND ANALYSIS OF FACTORIAL EXPERIMENTS 


where Ec; = 0. The components of the main effect of factor B have the 
following general form, 
(хс, 


В (component) = ; 
(ош ) np&c? 


where Ус; = 0. The form of the components of the AB interaction is 


(Ecc, AB; 


AB (component) — 
Comp ) nX(c,¢;)* 


where Xe; = 0 and Ec; = 0. The computational work is simplified if the 
levels of factors A and Bare equally spaced along the underlying continuum; 
in this case coefficients of orthogonal polynomials define the с, and с,. 
In case of unequal spacing, these coefficients may be modified by using 
methods described by Robson (1959). 

The components of the interaction obtained in this way describe features 
of the response surface. More extensive methods for exploring response 
surfaces are available. A survey of some of these methods is given in 
Cochran and Cox (1957, chap. 8A). 

If only one of the factors, say, factor B, is continuous, then the following 
analysis is possible: 


Source df 

A 2 

B 3 
Linear 1 
Quadratic 1 
Cubic 1 

AB 6 
Difference in lin trend 2 
Difference in quad trend 2 
Difference in cubic trend 2 


This method of partitioning the interaction is particularly appropriate in 
attempting to interpret differences in shapes of profiles. Computational 
details for this kind of partition are given in Secs. 6.9 and 7.6. Іп Sec. 7.6, 
computational steps are given in terms of a design calling for repeated 
measures. If the design does not have repeated measures, computational 
procedures for treatment effects remain the same but no partition of the 
error variation is required. 

For the case of a 2 x 2 factorial experiment in which both factors repre- 
sent what can be considered an underlying continuum, an exact fit to the 
surface defined by the four cell means is given by the polynomial 


X = ey +A + cB + САВ. 
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For the case of a 2 x 3 factorial experiment, an exact fit to the surface 
defined by the six cell means is given by the polynomial 


X = e + GA + eB + cB? + c,AB + c,AB?. 


The term 4B corresponds to the linear x linear interaction; the term AB? 
corresponds to the linear x quadratic interaction. 

Surfaces defined by polynomials which have as many terms as there are 
cell means are of no predictive value. Zero degrees of freedom remain for 
estimating standard errors; in essence such standard errors are infinitely 
large. The smaller the number of parameters required in defining the form 
of the appropriate response surface, the greater, potentially, the predictive 
value of the resulting equation. 


5.19 Replicated Experiments 


A replication of an experiment is anindependentrepetition underasnearly 
identical conditions as the nature of the experimental material will permit. 
Independent in this context implies that the experimental units in the repeti- 
tions are independent samples from the population being studied. That is, 
if the elements are people, in a replication of an experiment an independent 
sample of people is used in the replication. Inferences made from repli- 
cated experiments are with respect to the outcomes of a series of replications 
of the experiment. Ina sense that will be indicated, inferences from rep- 
licated experiments have a broader scope than do inferences from nonrepli- 
cated experiments. ог 

An experiment in which there are n observations per cell is to be distin- 
guished from an experiment having п replications with one observation per 
cell, The total number of observations per treatment is the same, but the 
manner in which the two types of experiments are conducted differs. Con- 
sequently the relevant sources of variation will differ; hence a change in the 
model for the experiment is needed. Associated with the latter change is a 
different set of expected values for the mean squares. 

In conducting an experiment having п observations per cell, all observa- 
tions in a single cell are generally made within the same approximate time 
interval or under experimental conditionsthatcan beconsidered to differ only 
asa function of experimentalerror. Inap х q factorial experiment having 
n observations per cell (no repeated measures) the total variation may be 
subdivided as follows. 


Source df 
Between cells pg = 1 
Within cells pan — 1) 


In this partition it is assumed that differences between observations within 
the same cell are attributable solely to experimental error. On the other 
hand, differences among cells are attributed to treatment effects and experi- 
Mental error. Should there be systematic sources affecting between-cell 
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variation, other than treatment effects and experimental error, such varia- 
tion is completely or partially confounded with the treatment effects. 

The purpose of a replicated experiment, in contrast to an experiment 
having n observations per cell, is to permit the experimenter to maintain 
more uniform conditions within each cell of the experiment, as well as to 
eliminate possible irrelevant sources of variation between cells, For 
example, the replications may be repetitions of the experiment at different 
times or at different places. Іп this case, sources of variation which are 
functions of time or place are eliminated from both between- and within- 
cell variation. 

Instead of having n replications with one observation per cell, one may 
have two replications in which there are n/2 observations per cell, one may 
have three replications with n/3 observations per cell, etc. The number of 
observations per cell for any single replication should be the maximum that 
will permit uniform conditions within all cells of the experiment and, at the 
same time, reduce between-cell sources of variation which are not directly 
related to the treatment effects. 

As an example of a replicated experiment, suppose that it is desired to 
have 15 observations within each cell of a 2 x 3 factorial experiment. 
Suppose that the experimental conditions are such that only 30 observations 
can be made within a given time period; suppose further that there are, 
potentially, differences in variation associated with the time periods. To 
eliminate sources of variation directly related to the time dimension, the 
experiment may be set up in three replications as follows (cell entries indicate 
the number of Observations): 


Replication 1 Replication 2 Replication 3 

by bs by by b, bs by by bs 
а| 5 5) 5 а 5 5 5 a 5 5 9 
а| 5 5 5 а; 5 5 5 аз 5 5 5 


If the replications are disregarded, there are 15 observations under each 
treatment combination. Within each replication, conditions may be 
relatively uniform with respect to uncontrolled sources of variation ; between 
replications, conditions may not be uniform. In essence, a replicated 
experiment adds another dimension or factor to the experiment—a replica- 
tionfactor. The latter is a random factor. 

In Table 5.19-1 the analysis of a P X q factorial experiment having п 
observations per cell is contrasted with a P X q factorial experiment in 
Which there are r replications of a р X q factorial experiment having n/r 
Observations per cell. In both designs there are л observations under 
treatment combination and a total of npq — 1 degrees of freedom. The 
degrees of freedom in braces in part i are often pooled in a replicated experi- 
ment. Part ii gives an alternative partition of the variation. In the non- 
replicated experiment the between-cell variation defines the estimates of 
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variation due to treatment effects. On the other hand, in the replicated 
experiment part of the between-cell variation is due to replication effects 
as well as interactions with replications. The within-cell variations in both 
designs is considered to be due to experimental error. Since observations 


Table 5.19-1 Comparison of Analysis for Replicated and Nonreplicated 
Factorial Experiments 


p * q factorial experiment with r replications of p x q factorial experiment 
n observations per cell with n/r observations per cell 
P ый df df 
A Pasil A Pu 
B Gil B сен 
АВ (р — 104—1) AB (p = 009 —1) 
; Reps TUI 
i) A x rep (р = I(r = 1) 
Within cell рп — 1) B x rep (gis Dre) 
AB x rep (р = Dq — Dr — 1) 
Within cell palalr) = 1] 
Total пра -1 Total npq — 1 
Between cells pq —1 Between cells рд" —1 
5 Treatments pa -i 
(ii) Reps r-i 
Treat x rep (pq — Vr — 1) 
Within cell pa(n — 1) Within cell parlar) = 1] 


— оа se urged ia EE 


in the replicated experiment are made under somewhat more controlled 
conditions, the within-cell variation in a replicated experiment is potentially 
smaller than the corresponding variation in a nonreplicated experiment. 
The partition of the total variation given in part ii may be illustrated 
numerically. Considera 3 x 4 factorial experiment in which there are to 
be 10 observations under each treatment combination. The design calling 
for 10 observations per cell is contrasted with the design calling for five 
replications with 2 observations per cell in the following partition: 


10 observations per cell 5 reps, 2 observations per cell 


Between cells 11 Between cells 59 
Treatments 11 
Reps 4 
Treat x rep 44 
Within cell 108 Within cell 60 


т Є_`_—„—————— 


In а replicated design of this kind interactions with replications are often 
Considered to be part of the experimental error. However, preliminary 
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tests on the model may be used to check upon whether or notsuch pooling is 
justified. 

There are both advantages and disadvantages to replicated experiments 
when contrasted with those in which there аге л observations percell. The 
advantages of being able to eliminate sources of variation associated with 
replications depend upon the magnitude of the latter relative to treatment 
effects and effects due to experimental error. One possible disadvantage of 
a replicated design might arise in situations requiring precise settings of 
experimental equipment. Considerable time and effort may be lost in 
resetting equipment for each of the replications, rather than making all 
observations under a single set of experimental conditions before moving to 
the next experimental condition. However, the inferences which can be 
drawn from the replicated experiment are stronger when possible variations 
in the resettings are considered as a relevant source of variation in the con- 
duct of the experiment. 


5.20 Тһе Case n = 1 and a Test for Nonadditivity 


Ifthere is only one observation in each cell ofap x q factorial experiment, 
there can be no within cell variation and hence no direct estimate of experi- 
mental error. Among other models, the following two may be postulated 
to underlie the observed data: 

(i) Xi = + a B; + 6. 

(ii) Xi =u o + В, + 2D; + Eiz 

In (i) no interaction effect is postulated, hence all sources of variation other 
than main effects are considered to be part of the experimental error. In 
(ii) an interaction term is postulated. From some points of view the inter- 
action term may be considered as a measure of nonadditivity of the main 
effects. In this context, (i) will be considered the additive model, whereas 
(ii) will be considered the nonadditive model. 

The choice of the scale of measurement for the basic criterion measure- 
ment will to a large extent determine whether (1) or (ii) is the more appro- 
priate model. The degree of appropriateness is gauged in terms of the 
degree of heterogeneity of the experimental error under model (i. Subject- 
matter knowledge and experience gained from past experimentation about 
the functional form of the underlying sources of variation are the best 
guides for deciding between models (i) and (ii). To supplement these 
sources of information, the experimenter may want to use information 
provided by preliminary tests before specifying the model. So far in this 
chapter such tests have been considered only for the case in which direct 
estimates of experimental error were available from the experimental data. 
Tukey (1949) has developed a test applicable to the case in which there is a 


single observation per cell. This test is discussed in detail by Scheffé (1960, 
рр. 130-134). 
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Tukey’s test is called a test for nonadditivity. Its purpose is to help in the 
decision between models (i) and (ii). This test has also been used to choose 
between alternative possibilities for the scale for measurement for the 
criterion, the decision being made in favor of the scale for which (i) is the 
more appropriate model. 

The principle underlying Tukey’s test for nonadditivity is this: Assuming 
model (i) to hold, the variation due to sources other than main effects is 
divided into two parts. One part corresponds to the equivalent to the 
linear x linear component of the AB interaction. This part measures what 
is called nonadditivity. The second part, or balance, is what is left. 
Should the component measuring nonadditivity be significantly larger than 
the balance, a decision is made against model (i). The component associ- 
ated with nonadditivity is defined to be 


УУ (сусу) 
where c, = (4; — G) and c; = (B, — G). An equivalent form for this 
last expression is 


MSnonaaa = 


[xdi — OB; — OX; k 
55,55,/р4 
Because of the way in which the c’s are defined, it will be found that 
ас = (халха). 
In the application of Tukey’s test for nonadditivity to the case of ap х q 
factorial experiment having one observation per cell, the analysis of variance 


takes the following form: 


SSnonaaa == 


Source of variation 55 df MS 

A SS, pail MS, 

B SS, 4-1 MS, 

Residual SS on. (p = 09 = 1) 
Nonadditivity SS 1 MSnonadd 
Balance SSpat (р —1)(4—1)—1 | MSu 


inre ре жән oir Cn prre em SH TUM TS RR 
The test for nonadditivity is given by 


= MSnonaaa 
ЕТТІ 


When this Fratio exceeds the critical value defined by the level of si gnificance 
of the test, the hypothesis that model (1) is appropriate is rejected. „Tukey s 
test for nonadditivity is sensitive to only one source of nonadditivity—that 
associated with what is equivalent to a linear x linear component of an 
interaction. In working with this component there is an implicit assump- 
tion that, the larger the main effects of the individual levels in a treatment 


F 
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combination, the larger the potential interaction effect for the treatment 
combination, if this does exist. This assumption appears reasonable in 
some cases. In other cases it might be that the equivalent of the linear x 
quadratic or the quadratic x quadratic component could more appropri- 
ately be used as a measure of nonadditivity. A numerical example is given 
in Sec. 6.8. 

The principles underlying Tukey’s test can be extended to three-factor as 
well as higher-order factorial experiments. For three-factor experiments 
the measure of nonadditivity is given by 
ГЕ Хе Х,У 

УУУ (сүс ус) 
where с; = 4; — G,c; = В, — G,andc, = C, —G. An equivalent form 
of this last expression is 


SSnonaaa = 


(рат [УУ (сез, Х,)р 
(55,)(55,)(55) ` 
Because of the way in which the c’s are defined, it will be found that 
УУ (осо)? = (Ec) (9$ (Ec). 


SSnonaaa 


5.21 The Choice of a Scale of Measurement and Transformations 


In the analysis of variance of a factorial experiment, the total variation of 
the criterion variable is subdivided into nonoverlapping parts which are 
attributable to main effects, interactions, and experimental error. The 
relative magnitude of each of the corresponding variances depends upon 
the scale of measurement as well as the spacing of the levels of the factors 
used in the experiment. In cases where alternative choices of a scale of 
measurement appear equally justifiable on the basis of past experience and 
theory, analysis in terms of each of the alternative scales is warranted pro- 
vided that each of the scales satisfies the assumptions underlying the respec- 
tive analyses. 

It may happen that within-cell variances will be homogeneous in terms 
of one scale of measurement but heterogeneous in terms of a second scale. 
The within-cell distributions may be highly skewed in terms of one scale but 
approximately normal in terms of a second scale. In terms of one scale of 
measurement, an additive (i.e., no interaction terms) model may be ap- 
propriate, whereas in terms of a second scale the additive model will not be 
appropriate. , 

In determining the choice of a scale of measurement for the observed 
data, two cases will be contrasted. In one case, a priori theory and experi- 
ence determine the appropriate model as well as the appropriate scale. In 
the second case, where there is neither adequate theory nor experience to 
serve as guides, the appropriate model and the proper scale of measure- 
ment are determined only after the experimental data have been partially 
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analyzed. In the latter case the design of the experiment should provide 
the experimenter with sufficient data to permit the evaluation of alternative 
formulations of the model. 

A readable summary of methods for determining appropriate transforma- 
tions will be found in Olds et al. (1956). А series of alternative methods is 
also given by Tukey (1949). A transformation in this context is a change in 
the scale of measurement for the criterion. For example, rather than time 
in seconds the scale of measurement may be logarithm time in seconds; 
rather than number of errors, the square root of the number of errors may 
be used as the criterion score. There are different reasons for making such 
transformations. Some transformations haveas their primary purpose the 
attainment of homogeneity of error variance, The work of Box (1953) 
has shown that the distribution of the F ratio in the analysis of variance is 
affected relatively little by inequalities in the variances which are pooled into 
the experimental error. Transformations which have homogeneity of 
error variance as their primary purpose are relatively less important than 
they were formerly considered to be. With regard to the usual tests for 
homogeneity of error variance, Box (1953, p. 333) says, “To make the pre- 
liminary tests on variances is rather like putting to sea in a rowing boat to 
find out whether conditions are sufficiently calm for an ocean liner to leave 
port.” 

Another reason for using transformations is to obtain normality of 
within-cell distributions, Often non-normiality and heterogeneity of vari- 
ance occur simultaneously. The same transformation will sometimes 
normalize the distributions as well as make the variances more homogeneous 
The work of Box (1953) has shown that the sampling distribution of the F 
ratio is relatively insensitive to moderatedeparturesfromnormality. Hence 
transformations whose primary purpose is to attain normal within-cell 
distributions are now considered somewhat less important than was the case 
previously. 

A third reason for transformations is to obtain additivity of effects. In 
this context additivity of effects implies a model which does not contain inter- 
action terms, Та some of the designs which are discussed in later chapters, 
a strictly additive model is required. In designs of this type certain inter- 
action effects are completely confounded with experimental error. For 
those designs which do permit independent estimation of interaction effects 
and error effects, the strictly additive model is not essential There are, 
however, advantages to the strictly additive model, if it is appropriate, over 
the nonadditive model—particularly in cases in which fixed and random 
factors are in the same experiment. The interaction of the random factors 
with the fixed factors, if these exist, will tend to increase the variance of the 
sampling distribution of the main effects. Tukey (1949) has pointed out 
rather vividly the influence of the scale of measurement upon existence or 
nonexistence of interaction effects. 
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The use of transformations in order to obtain additivity has received more 
attention in recent works than it has in the past. Tukey's test for non- 
additivity has been used, in part, as a guide for deciding between alternative 
possible transformations. It is not possible to find transformations which 
will eliminate nonadditivity in all cases. In some cases there is an intrinsic 
interaction between the factors which cannot be considered a function of 
the choice of the scale of measurement. These cases are not always easily 
distinguished from cases in which the interaction is essentially an artifact 
of the scale of measurement. 

A monotonic transformation is one which leaves ordinal relationships 
(і.е., greater than, equal to, or less than) unchanged. If the means for the 
levels of factor A have the same rank order for all levels of factor B, then a 
monotonic transformation can potentially remove the A x B interaction. 
When such rank order is not present, a monotonic transformation cannot 
remove the A x B interaction. Only monotonic transformations will be 
discussed in this section. Nonmonotonic transformations in this connec- 
tion would be of extremely limited utility; the author is aware of no studies 
in which the latter class of transformation has been used. 

There are some over-all guides in selecting a scale of measurement which 
will satisfy the assumptions of homogeneity of error variance. These 
guides will be considered in terms of the relationship between the cell means 
and the cell variances. The latter relationship will be presented in terms of 
a p х q factorial experiment; the principles to be discussed hold for all 
designs. 

Case (i): оў = Hj. In this case the cell variances tend to be functions 
of the cell means: the larger the mean, the larger the variance. This kind of 
relationship exists when the within-cell distribution is Poisson inform, For 
this case, a square-root transformation will tend to make the variances more 
homogeneous. This transformation has the form 


Xin = VOTES 


where X is the original scale and X" is the transformed scale. If Y is a 
frequency, i.e., number of errors, number of positive responses, and if X 
is numerically small in some cases (say, less than 10), then a more appro- 
priate transformation is 


Xin = Xa + AX +1. 


The following transformation is also used for frequency data in which some 
of the entries are numerically small: 


Xin = JXin + 4. 


Either of the last two transformations is suitable for the stated purpose. 
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Case (ii): o7; = a(l — ди). This case occurs in practice when the 
basic observations have a binomial distribution. For example, if the basic 
observations are proportions, variances and means will be related in the 
manner indicated. The following transformation is effective in stabilizing 
the variances, 

Ж 2'atcsin Xue 


where X;,,,isa proportion. In many cases only a single proportion appears 
in a cell. Tables are available for this transformation. Numerically, 
Xj,isan angle measured in radians. For proportions between .001 and 
.999, X; assumes values between .0633 and 3.0783. The notation sin? 
(read inverse sine) is equivalent to the notation arcsin. For values of X 


close to zero or close to unity, the following transformation is recommended, 
Xix = 2 arcsin YX i + (1n), 


where л is the number of observations on which X is based. 
Case (iii): o?, = К?н. In this case the logarithmic transformation will 
Stabilize the variances. 
Xim = log Хуу. 


To avoid values of Y close to zero, an alternative transformation 
ук = log (Xi + 1) 


is often used when some of the measurements are equal to or close to zero. 
The logarithmic transformation is particularly effective in normalizing dis- 
tributions which have positive skewness. Such distributions occur in 
psychological research when the criterion is in terms of a time scale, i.e., 
number of seconds required to complete a task. | 

Use of the Range in Deciding between Alternative Transformations. In 
deciding which one of several possible transformations to use in a specific 
problem, one may investigate several before deciding which one puts the data 
in a form that most nearly satisfies the basic assumptions underlying the 
analysis of variance. The use of the range statistic (or the truncated range 
Statistic) provides a relatively simple method for inspecting the potential 
usefulness of several transformations with a minimum of computational 
effort. An example given by Rider et al. (1956, pp. 47—55) will be used to 
illustrate the method. ; 

In this example, eight operators individually measured the resistance of 
each of four propeller blades with each of two instruments. Order was 
randomized. The range of the 16 measurements on each blade in terms of 
the original scale as well as in terms of transformed scales is given below. 
(Only the end points of the ranges in terms of the original scale need to be 
transformed in order to obtain the following data.) 


222 DESIGN AND ANALYSIS OF FACTORIAL EXPERIMENTS 


Scale of measurement 


Blade - - 
Original | Square root | Logarithm | Reciprocal 

1 310 | 061 021 0.077 

2 010 | 0.08 0.12 0.833 

3 0.15 0.12 0.17 1.111 

4 11.00 1.01 0.16 0.015 


The logarithmic transformation is seen to make the ranges more uniform. 
In many practical cases the range tends to be proportional to the variance. 
A transformation which tends to make the ranges uniform will also tend to 
make the variances more uniform. As a further step in checking on the 
adequacy of the logarithmic transformation, the authors applied Tukey’s 
test for nonadditivity to each of the four interaction terms. On the original 
scale of measurement, two of the four F ratios for nonadditivity were signifi- 
cant at the 5 per cent level. None of the F ratios for nonadditivity was 
significant at the 5 per cent level in terms of the logarithmic scale. 


5.22 Unequal Cell Frequencies 


Although an experimental design in its initial planning phases may call 
for an equal number of observations per cell, the completed experiment may 
not have an equal number of observations in all cells. There may be many 
reasons for such a state of affairs. The experimenter may be forced to work 
with intact groups having unequal size; the required number of individuals 
in a given category may not be available to the experimenter at a specified 
time; subjects may not show up to complete their part in an experiment; 
laboratory animals may die in the course of an experiment. If the original 
plan for an experiment calls for an equal number of observations in each 
cell, and if the loss of observations in cells is essentially random (in no way 
directly related to the experimental variables), then the experimental data 
may appropriately be analyzed by the method of unweighted means. In 
essence the latter method considers each cell in the experiment as if it con- 
tained the same number of observations as all other cells (at least with regard 
to the computation of main effects and interaction effects). 

Under the conditions that have been specified, the number of observations 
within each cell will be of the same order of magnitude. The procedures for 
an unweighted-means analysis will be described in terms of a 2 x 3 fac- 
torial experiment. These procedures may be generalized to higher-order 
factorial experiments. The number of observations in each cell may be 
indicated as follows: 
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The harmonic mean of the number of observations per cell is 
Pq 
(1/ny) + (лы) + > ++ + (L/n,,) 
In the computation of main effects and interactions each cell is considered 
to have 7, observations. (The harmonic mean rather than the arithmetic 
mean is used here because the standard error of a mean is proportional to 
Цп,; rather than л,;.) 
The mean for each of the cells may be represented as follows: 


b b ы 


л, 


a АВ, АВ АВ, 
а, | АВ, АВ АВ, 
The estimate of the mean ш; is 
DAB; 


fas 
q 


That is, 4, is the mean of the means in row a;, not the mean of all observa- 
tionsatlevela,. These two means will ditfer when each cell does not have 
the same number of observations. The estimate of y, is 


УАВ, 


р 
Again there will be a difference between the mean of the means within a 
column and the mean of all observations іп а column. The grand mean in 
the population is estimated by 
УА, _ XB, _ XXAB, 
pi. p 
Variation due to main effects and interactions are estimated by the follow- 
ing sums of squares: 
SS, = iaX(4, — 0), 
SS, = npX(B; — Gy, 
SS, = fi, (ABs = А, a В, ns оу. 
These sums of squares have the same formas corresponding sums of squares 
for the case of equal cell frequencies. However, 4,, B;, and Gare computed 
ina different manner. Ifallcell frequencies were equal, both computational 
procedures would lead to identical results. 
The variation with cell ij is 4 
(zx А) 


SS, = XXin == 
т 


ni 
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The pooled within-cell variation is 
SSw. сеп = ZXSS,;. 
The degrees of freedom for this latter source of variation are 
dfw, сеп = (ZXn;) — pq. 

Other methods are available for handling the analysis for unequal cell 
frequencies. If, however, the differences in cell frequencies are primarily 
functions of sources of variation irrelevant to the experimental variables, 
there are no grounds for permitting such frequencies to influence the esti- 
mation of population means. On the other hand, should the cell fre- 
quencies be directly related to the size of corresponding population strata, 
then such frequencies should be used in estimating the mean of the 
population composed of such strata. 


5.23 Unequal Cell Frequencies—Least-squares Solution 


The general least-squares solution for the sums of squares in a factorial 
experiment is computationally more difficult than the corresponding solu- 
tion for the unweighted-means analysis. There is some evidence to indicate 
that the resulting tests, in the least-squares case, are the more powerful. 
Suppose that the following linear model is appropriate for the first stage of 
the estimation process: 

(1) АВ, = p + a + By + s. 
The parameters on the right-hand side of (1) satisfy the structural require- 
ments of linear model I. For the case of unequal cell frequencies, the 
restrictions on the parameters may be taken to be 

Znj«,—0 and Хп, =0. 


Let the least-squares estimators of the corresponding parameters be desig- 
nated by /i, ĉ, and fj. These estimators are determined in such a manner 
as to make 


(2) XX(AB,, — й — 8, — fj! = minimum, 
subject to the restrictions 
En4,—0 and  Xnjf,—0. 

The restrictions in (1) and (2) are introduced for computational con- 
venience; they have no effect upon the resulting sums of squares. The 
same sums of squares would result if the restrictions had the form 

X4,—0 and  2f,—0. 
By differentiating (2) with respect to ñ, the least-squares equation defining 
fis 
G 


(3) 4-4. 
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The least-squares equations defining the й, can be shown (Kempthorne, 
1952, p. 80) to have the following form: 


nub, + тыб» + `` + пй, = Bi, 
(4) nafi + nape +++ + napa = Boy 
пй, + паба. + Ngaba = Bo 


The set of Eqs. (4) is analogous to the set of normal equations encountered 
in multiple regression. In the present case, the ///5 have the role of regres- 
sion coefficients, the n;;'s the role of covariances among the predictors, and 
the B/'s the role of covariances between predictor and criterion. 

The coefficients of the f//s are defined as follows: 


2 2 2 
Nir | Пэк Nok 
hn = mc [E gg 


1, "ә. Ny, 
пуп. пп пт 
п = —[ etin "не ү... atom Е 
ny, п», Ny, 


Since this last expression is symmetric in k and m, 
Nim = Mme 
The terms on the right-hand side of (4) are defined by 


pi = pe | + Пу4; жез tue], 
; у ny, Ng, ny, 


B; is called the adjusted total for the observations at level Б, 

Several procedures exist for solving the set of Eqs. (4) for the Й. „Itis 
not, however, necessary to obtain the individual f,'s in order to obtain the 
sum of squares associated with the f,’s. Ап example of a computational 
procedure for obtaining this sum of squares in this manner is given in Sec. 
6.14. If the individual 0/8 are obtained, then 
(5) SSyaaj) = Уй,В,. 

A general method for obtaining the Bs is described by Kempthorne (1952, 


р. 81). nde 
A set of simultaneous equations defining the @,’s has a form which is 


similar to (4), i.e., 
пл + Mga + +++ + Miti = Ai, 4724572 
In this case, 


2 2 2 
п п Па 
ELE aa 3 


Па, ns 
ШАП у Maal me Neama 
nb kimi | бтз |... ым), 
ny na na 


B, , mB.: ny,B, 
AA 1 me ЕН 
ny Па 4 


226 DESIGN AND ANALYSIS OF FACTORIAL EXPERIMENTS 


However, if SS,,,5 is known, SS,(aa;, may be obtained without the use of 
(5). This can be done as follows. 
Define the unadjusted sums of squares as 


2 2 

ss, = x(4) Eg 
ША n, 

2 2 

55, = z(2)- =i 
ny n. 


2 2 
SScens = xz (452 )- T 


nj n. 
The following relationship may be shown to hold: 
(6) SS, zz SSiaaj == SSayaajy zn SS,, 
Hence, if SS,,,5 has been computed, then 
SSaraajy) = SS, + SSiaaj — SS,. 

The minimum value that (2) obtains defines the residual variation among 
cells, i.e., that part of the variation among the cell means that cannot be 
predicted from the least-squares estimators of the main effects due to factors 
Aand B. This residual variation is actually that due to the AB interaction. 
It can be shown that 
(7) SSres trom celts = $$араај) = 5бееп» — SS, — 55ңадй)). 
Equivalently, 


SSres trom cells = SSav(aajy = SSceus SSavaai SS,. 

If the &,’s and й; have been computed, then the interaction sum of squares 
is given by 

SSasiaajy = SScens — X&A; — Xf;B;. 
Since %, and DB, are determined by a procedure which minimizes this quantity, 
SS, аар is the least-squares estimate of the variation due to interaction. 

1 There are computational methods for obtaining SS,,,4 without having 

either the &,’s or the f/s (see Rao, 1952, рр. 96-98). Given $5 лау, one 
has 

SSaçadj) mar SSeens = SSadiaaiy x SS,, 

SSyaaj; = SSceus — SSasiaajy — SS,- 
A numerical example of these computational procedures is given in Sec. 
6.14. 
. The within-cell variation is defined in the same manner as that described 
intheunweighted-meansanalysis. Over-all tests follow the same procedure 
as that used in the case of equal cell frequencies. Should the interaction be 
significant, primary interest is in the simple main effects rather than the over- 
all main effects. Analysis of the simple main effects takes the form of a 
single-classification analysis of variance having unequal cell frequencies. 
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The complete set of normal equation obtained from (2), for the special 
case of a 2 x 3 factorial experiment, may be represented schematically as 
follows. 


йа 8 Ê b Bs 


XXn; n, ng Ny ng Ng G 
т. m, O т Me т Ay 
п. 0 п», May Mag Mag | _ Ag 
Ny Tat па Bh Ome В, |' 
Ng с Па ПОН л. 9 В, 
пз Dg Па 0 0 Пп, В. 


The normal equations given by (3) and (4) represent subsets of (һе above 
complete set. The subset (3) is obtained by “sweeping out” the effects of 
factor 4. In the above schematic representation, the second equation has 
the form 


ny fi + my & + 08, + пуу + тә» + таб = A). 


CHAPTER 6 


Factorial Experiments—Computational 
Procedures and Numerical Examples 


6.1 General Purpose 


In this chapter the principles discussed in Chap. 5 will be illustrated by 
numerical examples; detailed computational procedures will be given for a 
variety of factorial experiments. It should be noted that the formulas con- 
venient for computational work are not necessarily those which are most 
directly interpretable. For purposes of interpretation the basic definitions 
given in Chap. 5 аге the important sources for reference; however, in all cases 
the computational formulas are algebraically equivalent to the basic 
definitions. The algebraic proofs underlying this equivalence are not 
difficult. Factorial experiments in which there are repeated measures are 
discussed in Chap. 7. 


6.2 p х q Factorial Experiment Having л Observations per Cell 


The treatment combinations in this type of experiment are represented in 
the cells of the following table: 


ы (үз. С Бу өнеді, <br 
а | aby аба co aby cc абы 
а | aba аба +++ аб, с>: абы 
а; Gb аба -:: aby e abi, 
а, | aby, abp `` ав) c abp 


The cell in row a, and column 5, corresponds to that part of the experiment 
in which treatment a; is used in combination with b; to yield the treatment 
228 
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combination аб, The first subscript in this notation system refers to the 
level of factor A, the second subscript to the level of factor B. 

Within each cell of the experiment there are л observations. The obser- 
vations made under treatment combination ab, may be symbolized as 
follows: 


The symbol X;, denotes a measurement on the first element in this cell. 
The symbol Х;» denotes a measurement on the second element in this cell. 
The measurement on element К is denoted by the symbol АХ. The sub- 
script k assumes the values 1, 2, . . . , within each of the cells. The sum of 
the n observations within cell ab; will be denoted by the symbol АВ,,; thus, 


AB, = DX ine: 
k 


The following table summarizes the notation that will be used for sums of 
basic measurements: 


=> 
ы by b; b, Row sum T 
a АВ) АВ! "f AB; p АВ, | A, = ХАВ! 

d, АВ, |: ABgY Темп САВЫ vd! A Ber 301i 
4; AB АВ» 504 АВ); с АВ, A; 

а АВ АВ, pua AB); us AB,, ^» ^ 
Column sum В, By ед. В, ... В, G 

Dees ee 


The sum of the ng measurements in row i, that is, the sum of all measure- 
ments made at level a,, is denoted by the symbol 4,. Thus, 


A, = JAB; = УХХ. 
The double summation symbol УУ indicates that one sums within each cell 
ee 


as well as across all cells in row i (Summing over the subscripts j and К is 
equivalent to summing over all observations within a given row.) Thesum 
of the np measurements in column j, that is, the sum of all measurements 
made under level 5;, is denoted by the symbol В). Thus, 


B;= АВ, = ХХХ 
i i 
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The grand total of all measurements is denoted by the symbol G. Thus, 


gU ХА, = 28; = 254В,; = УУУХ, 


i jk 


The mean of all measurements under treatment combination ab,, is 


== B,, 
ap, айы. 
n 
The mean of all measurements at level a, is 
eq eee 
A,=—. 
ng 
The mean of all measurements at level b; is 
A ch 
np 
The grand mean of all observations in the experiment is 
бы 
npq 


To summarize the notation, a capital letter with a subscript ijk represents 
an individual observation; a pair of capital letters with the subscript ij 
represents the sum over the п observations represented by the subscript К. 
A capital letter with the subscript i represents the sum of zig observations 
within a row of the experimental plan; a capital letter with the subscript j 
represents the sum of the np observations within a column of the experi- 
mental plan. A widely used equivalent notation system is summarized 
below: 


Notation Equivalent notation 
Хук Хук 
АВ; Xij. 
Ai Xi.. 
B; Xj. 
G x 


In the equivalent notation system, the periods indicate the subscript over 
which the summation has been made. 

Definition of Computational Symbols. In order to simplify the writing 
of the computational formulas for the sums of squares needed in the analysis 
of variance, it is convenient to introduce a set of computational symbols. 
The symbols appropriate fora p. x q factorial experiment having n observa- 
tions per cell are defined in Table6.2-1. The computational procedures for 
this case are more elaborate than they need be, but the procedures to be 
developed here can be readily extended to more complex experimental 
plans in which they are not more elaborate than they need be. 

In using the summation notation, where the index of summation is not 
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indicated it will be understood that the summation is over all possible sub- 
scripts. For example, the notation Y Y Y X5, will be abbreviated УЛ; 


i k 
similarly the notation Y (4B,;)* will be abbreviated У(АВ,,)?. Where the 


1 
summation is restricted to a single level of one of the subscripts, the index 
of summation will be indicated. Thus, 


X(AB,Y = ABT; + АВ», ++ + AB} + +++ АВ), 
X(AB.? = АВ + АВ + °° + + АВ + >> + ABS 


In the definitions of the computational formulas given in part i of Table 
6.2-1, the divisor in each case is the number of observations summed to 


Table 6.2-1 Definition of Computational Symbols 


(1) = С?[прд (3) = AD _ 
(i) (2) = ХХ), (4) = (2B5)/np 
(5) = [2048,,)/л 


a) ox 
Source of variation Computational formula for SS 


A Ss, = (3) — (1) 
B SS, = (4) — (1) 
(i) AB SS,» = (5) — (3) — (4) + (1) 
Experimental error 
(within cell) SSerror. = (2) — (5) 
Total 55ыш = (2) — (1) 


obtain one of the terms that is squared. For example, n observations are 
summed to obtain an AB,; Hence the denominator n in the term 
[X(AB,)]n. There are nq observations summed to obtain an А1. Hence 
the denominator ng in the term (247)/nq. _ rae: 
An estimate of the variation due to the main effects of factor А is given by 

SS, = ngX(A, — Gy. 
This is not a convenient computational formula for this source of variation. 
An algebraically equivalent form is 

yv 42 2 
PON Wed qq 


ss, = = — 
nq npq 
The other sources of variation in the analysis of variance are as follows: 
SS, = npX(B; — 6) = (4) — (1), 
SS,, = n(ZAB,, — А, — Bj + б) = (5) —  — (4) + (1). 
SSy сеп = X(X,5, — АВ)» = (2) - (5), 
SStotar = E(X iz — G)? = (2) — (1). 


These computational formulas are summarized in part ii of Table 6.2-1. 
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Should the interaction term in the analysis of variance prove to be statis- 
tically significant, it is generally necessary to analyze the simple main effects 
rather than the over-all main effects. Computational symbols for variation 
due to simple main effects are summarized іп Table 6.2-2. By definition the 


Table 6.2-2 Definition of Computational Symbols 
for Simple Effects 


(3а) = .4 4 
Gay) = Ала 


(3a,) = Ат 


(4b,) = Впр 
(4b) = B3/np 


(46,) = Впр 


(3) = CA) 


(4) -(“В)/пр 


(i) 
Gay) = [DXXABylIn (56) = (Ават 
Ga) = DX Byj'lln (Sb) = [DXGLBi Yn 
(az) = BAB, )]/n (55) = ГУ(АВ,4/п 
(5) = [X(AB;j]/n (5) = [S(AB;;)"I/n 
Computational 
Source of variation formula for SS 
Simple effects for A: 
For level b, (55) — (45) 
For level b, (55) - (4ba) 
(ii) For level 5, (5) E а 


Simple effects for B: 
For level a, 
For level a, 


(5а) - Ga) 
(Say) — Gay) 


(Sa,) — (3a,) 


For level a, 


variation due to the simple main effect of factor A for level b, is 
SS, гогь, = nX(AB4, — Bj 


2 2 
= ADU DIIS цр. 
np 


n 
By definition, the variation due to the simple main effect of factor B for 
level a, is 
55, гога, = nX(AB,; — А)? 


Х(А 2 ү? 2 
-HABD Ж (a) — (зау. 
п па 
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Numerical Example. A numerical example of a 2 x 3 factorial 
experiment having three observations per cell will be used to illustrate the 
computational procedures. Suppose that an experimenter is interested in 
evaluating the relative effectiveness of three drugs (factor B) in bringing 
about behavioral changes in two categories, schizophrenics and depressives, 


Table 6.2-3 Numerical Example 


Observed data: 


() Category a, 8 4 0 10 8 6 8 6 4 


Category a, | 14 10 6 4 2 0 15 12 9 
= I = 
AB summary table: 


(i) а, 12 24 18 54 = A, 


(1) = (126/18 882.00 
(2) = (82 + 42 +02 +--+ + 15° + 12? + 9%) = 1198 
(ш) (3) = (542 + 7229 = 900.00 
(4) = (422 + 30? + 54)/6 930.00 
(5) = (122 + 24 + 18% + 30° + 6? + 36°)/3 = 1092.00 


y www 


SS, — (3) — (1) — 900.00 — 882.00 — 18.00 

А SS, = (4) — (1) = 930.00 — 882.00 = 48.00 
Giv) 55,6 = (5) — (3) — (4) + (0) 

= 1092.00 — 900.00 — 930.00 + 882.00 = 144.00 

SSw. cen = (2) — (5) = 1198 — 1092.00 = 106.00 

SStotar = (2) - (1) = 1198 - 882.00 = 316.00 


of patients (factor A). What is considered to be a random sample of nine 
patients belonging to category а, (schizophrenics) is divided at random into 
three subgroups, with three patients in each subgroup. Each subgroup is 
then assigned to one of the drug conditions. An analogous procedure is 
followed for a random sample of nine patients belonging to category a, 
(depressives). Criterion ratings are made on each patient before and after 
the administration of the drugs. The numerical entries in part i of Table 
6.2-3 represent the difference between the two ratings on each of the patients. 
(An analysis of covariance might be more appropriate for this plan; co- 
Variance analysis is discussed in Chap. 11.) 
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Asa first step in the analysis, the AB summary table in part iiis obtained. 
The entry in row a, column b; is 


AB, — 84-4 +0 = 12. 
The entry in row a, column 5; is 
AB, = 15 + 12 +9 = 36. 


Data for all computational symbols except (2) are in part ii; computational 
symbol (2) is obtained from data in part i. The analysis of variance is 
summarized in Table 6.2-4. 


Table 6.2-4 Summary of Analysis of Variance 


Source of variation SS | df | MS F 
A (category of patient) 18.00 1 18.00 2.04 
B (drug) 48.00 2 24.00 2.72 
AB 144.00 2 72.00 8.15 
Within cell 106.00 олын 8.83 

Total 316.00 | 17 | 


„——————————— 


The structure of the F ratios used in making tests depends upon the ех- 
pected values of mean squares appropriate for the experimental data. 1 
the categories and drugs are fixed factors, i.e., if inferences are to be made 
only with respect to the two categories of patients represented in the ex- 
periment and only with respect to the three drugs included in the experiment, 
then the appropriate expected values of the mean squares are given in Table 
6.2-5. The structure of F ratios is determined in accordance with the 
principles given in Sec. 5.7. 


Table 6.2-5 Expected Values for Mean Squares 


Source MS | E(MS) F 
Main effect of 4 | MS, | в? + 90} | MSa/MSw. cen 
Main effect of B | MS, o? + 602 | MS,/MSvw. сей 
AB interaction MSas 6? + 362, | MSqo/MSw. cen 
Within cell MSy. сей в? 


If tests are made at the .05 level of significance, the critical value for the 
test of the hypothesis that the action of the drugs is independent of the cate- 
gory of patient (і.е., zero interaction) is Ғ,(2,12) = 3.88. In this case 
Е = 8.15 exceeds the critical value. Hence the experimental data do not 
support the hypothesis of zero interaction. The data indicate that the 
effect of a drug differs for the two types of patients—the effect of a drug 
depends upon the category of patient to which it is administered. A 
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significant interaction indicates that a given drug has different effects forone 
category of patient from what it has for a second category. The nature of 
the interaction effects is indicated by inspecting of the cell means. These 
means are given below: 


Drug! | Drug2 | Drug 3 


Category 1 4 8 6 
Саїерогу 2 10 2 12 


A geometric representation of these means is given in Fig. 6.1. This 
figure represents the profiles corresponding to the simple effects of the drugs 
(factor B) for each of the categories (factor A). A test for the presence of 
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Drug b, Drug b; Drug b3 


Figure 6.1 Profiles of simple effects for drugs. 


interaction is equivalent to a test on the difference in the shapes of the pro- 
files of these simple effects. An equivalent geometric representation of the 
table of means is given in Fig. 6.2. This figure represents the profiles cor- 
responding to the simple effects of the categories for each of the drugs. 
The profile for drug 2 appears to have a slope which is different from the 
Slopes of the profiles for the other drugs. Asan aid in the interpretation of 
interactions, geometric representation of the profiles corresponding to the 
means is generally of considerable value. r 

Tests on differences between means within the same profile are given by 
tests on simple effects. Computational procedures for obtaining the varia- 
tion due to simple effects are summarized in Table 6.2-6. Data from which 
the symbols in part ii are obtained are given in part i; the latter is the AB 
Summary of Table 6.2-3. 
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The analysis of variance for the simple effects of the drugs for each of the 
categories is summarized in Table 6.2-7. The structure of the F ratios for 
simple effects is dependent upon appropriate expected values for the mean 
squares. Assuming that factors А and В are fixed factors, expected values 
for the mean squares of simple effects are given in Table 6.2-8. Tt should 
be noted, in terms of the general linear model, that a simple main effect is 
actually a sum of an over-all main effect and an interaction. For example, 


AB,;—B; estimates 0; + «fij; = а; for b;. 
A test on the simple effect of factor A for level Р, is equivalent to a test 


that ož г.ь, = 0. The appropriate F ratio for this test would have 
an estimate of o? as denominator; the latter estimate is given by MS,, гш. 


12 S pz Drug 3 
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Figure 6.2 Profiles of simple effects for categories. 


By using the .05 level of significance for the data in Table 6.2-7, the critical 
value for an F ratio is Ғ,,(2,12) = 3.88. The experimental data indicate 
no difference between the drugs for category a, (Fos = 1.36). The experi- 
mental data indicate that there are differences between the drugs for cate- 
gory as (Fws = 9.51). Inspection of the profile of the drugs for category 
a; in Fig. 6.2 indicates that the effect of drug Б, is different from the effects 
of the other drugs. 

Returning to the analysis in Table 6.2-4, neither of the main effects is 
statistically significant. However, in the presence of interaction, inferences 
made with respect to main effects must be interpreted with caution. The 
data indicate that the main effects due to drugs do not differ. However, 
this does not mean that the drugs are equally effective for each of the cate- 
gories considered separately. Because of the significant interaction effect, 
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Table 6.2-6 Computation of Simple Effects 


AB summary table: А 
ы by b, Total 


(i) aul TRAC LR TS ERES 
a |3 6 35 т 


42 30 54 126 


(За) = (5429 = 324.00 (45) = (422)/6 = 294.00 
(Заз) = (72®)]/9 = 576.00 (4b) = (302)/6 = 150.00 
(4b) = (5456 = 486.00 
(ii) (3) = 900.00 (4) = 930.00 
(5a) = (122 + 24? + 182)/3 = 348.00 (5b,) = (122 + 30%)/3 = 348.00 
(Say) = (30? + 6° + 36%)/3 = 744.00 (5b) = (24° + 6%)/3 = 204.00 
(5%) = (182 + 362)/3 = 540.00 
(5) = 1092.00 6) = 1092.00 
Simple effects of A: 
For level 5, SS, tor b, = (5b) — (46) = 54.00 
For level 5, SS, for b, = (50) — (4р) = 54.00 
For level 0; SS, for b, = (50) - (45) = 54.00 
ss 162.00 
(iii) Simple effects of B: 
For level a; SSp for a, = (ба) - (За) = 24.00 
For level as 55) for а, = (5а) - (Заз) = 168.00 
192.00 
Check: SS, + SS, = SS, for b; Check: SS, + SS, = XSS, for a, 
18.00 + 144.00 = 162.00 48.00 + 144.00 = 192.00 


Table 6.2-7 Analysis of Variance for Simple Effects 
Source of variation 55 df MS F 


B for a, (drugs for category ау) 24.00 2 12.00 | 1.36 
В for ay (drugs for category аз) | 168.00 2 8400 | 9.51 
Within cell 106.00 | 12 8.83 
—-—-—-+: el 
Table 6.2-8 Expected Values for Mean Squares 
of Simple Effects 


Source of variation E(MS) 

Simple effects of factor A: 

For level Б 02 + 30% for by 

For level by of + 30% for by 

For level bg оў + 305 for by 
Simple effects for factor B: 

For level а, 02 + 308 for a, 

For level a; о? + 305 for dy 


OE 
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conclusions with respect to main effects due to drugs cannot be applied 
separately to each of the categories. 

The drugs are differentially effective only when one considers comparisons 
within each of the categories separately. It must be noted and emphasized 
that statistical inference is a tool to help the experimenter in drawing 
scientifically meaningful conclusions from his experiment. Too frequently 
the tool is allowed to become the master rather than the servant. The 
primary objective of the experiment should determine which tests are to be 
made and which tests are to be avoided for lack of meaning. Statistical 
elegance does not necessarily imply scientifically meaningful inferences. 

Individual Comparisons. To make comparisons between two or more 
means, the procedures given in Sec. 5.17 may be used. For example, to test 
the hypothesis that drugs 1 and 2 are equally effective for category 2, one 
may use the following statistic (data from the last numerical example will 
be used for illustrative purposes): 


_ (ABa — AB»)? 
2nMSw. сеп 


кс уы 
6(8.83) 

For a .05-level test, the critical value is F,,(1,12) = 4.75. Hence the 
hypothesis that 4) = иу» is not supported by the experimental data. 

Totest differences between all possible pairs of means in a /ogical grouping 
of means, the procedures given in Sec. 3.9 may be adapted for use. The 
Newman-Keuls procedure will be illustrated here. Suppose that it is 
desired to test the differences between all possible pairs of means for category 
45 ao procedures for such multiple comparisons are outlined in Table 

In part i the means to be compared are first arranged in rank order, from 
low to high. Then differences between all possible pairs which give a posi- 
tive value are obtained. An estimate of the standard error of a single mean 
is computed in part ii. This estimate is based upon data in all cells in the 
experiment, not just those from which the means are obtained. (If there is 
any real question about homogeneity of variance, only cells from which 
the means are obtained may be used in estimating the standard error of a 
mean.) If the within-cell variance from all cells in the experiment is used, 
degrees of freedom for sẹ are pg(n — 1), which in this case is equal to 12. 

To obtain critical values for a .05-level test, one obtains values of the 
Y.os(r,12) statistic, where 12 is the degrees of freedom for зу and ғ is the 
number of steps two means are apart in an ordered sequence. ` These values 
are obtained from tables of the q statistic in Table B.4. The actual critical 
values аге s¢q.9;(r,12). For example, the difference 


АВ, = АВ, = 10. 


р X 4 FACTORIAL EXPERIMENT 239 
These means are three steps apart in the ordered sequence (r — 3); hence 
the critical value is 6.45. The difference 
АВ, "ri АВ, = 2, 
for which r = 2, has the critical value 5.27. Tests of this kind must be made 
in a sequence which is specified in Sec. 3.9. 


Table 6.2-9 Comparisons between Means 
for Category a, 


АВ, АВ, АВ, 
Ordered means: 2 10 12 
© АВ, АВд АВ» 
Differences: АВ» — 8 10 
АВ» ст; 
шие --- 
(ii) sy = VMS, сешп = V8.83/3 = 1.71 
НЕЕ СЕЕ ЕНЕ 
9.5(7,12): 208 ӘЛ! 
5.09. 50712): 527 645 
(iii) АВ, АВд ABa 
ж ж 


АВ 
АВ - 
— i аа ЕНаЕ 

From the outcome of the tests in part iii one concludes that, for category 
a» drugs | and 3 differ from drug 2 but that there is no statistically signifi- 

cant difference (on the criterion used) between drugs 1 and 3. k 
Test for Homogeneity of Error Variance. Although F tests in the 
analysis of variance are robust with respect to the assumption of homo- 
geneity of error variance, a rough but simple check may be made = m 
assumption through use of the Fmax Statistic. Apart from the use of this 


statistic, the variances of the individual cells should be inspected for any 
kind of systematic pattern between treatments and variances. In a 
having a relatively large number of observations within each cell, within- 
cell distributions should also be inspected. | 

Use of the Fax test for homogeneity of varia 
data in part i of Table 6.2-3. For this purpose 
each of the cells is required. In the computat 
Table 6.2-3, the pooled within-cell variation from а 


The variation within cell ij has the form 
SS;; = X (Xin — AB; 
n 


nce will be illustrated for the 
the within-cell variation for 
ional procedures given in 
Il cells is computed. 


=®Х 
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For example, 


S$, = (8? + 4 + 0) — - 


= 80 — 48 = 32. 
Similarly, the variation within cell ab», is given by 
о» 


SS, = (142 + 10? + 6%) — 


= 332 — 300 =32. 
The other within-cell sums of squares are 
SS, —8,  SS,—8, SS =8,  SSy=18. 
As a check on the computational work, 
XSS,, = SSw.cen = 106. 


Since the number of observations in each cell is constant, the Ру statistic 
is given by 
SS(largest) 32 


F Tu eed ао 
mx $S(smallest) 8 


= 4.00. 

The critical value for a .05-level test is Ғ..(р4,п- 1), which in this case is 
Еь(6,2) = 266. Since the observed Fmax statistic does not exceed the 
critical value, the hypothesis of homogeneity of variance may be considered 
tenable. In cases in which the assumptions of homogeneity of variance 
cannot be considered tenable, a transformation on the scale of measurement 
may provide data which are amenable to the assumptions underlying the 
analysis model. 

Approximate F Tests When Cell Variances Are Heterogeneous. A 
procedure suggested by Box (1954, p. 300) may be adapted for use in testing 
simple effects for factor A, even though variances may be heterogeneous. 
The F ratio in this case has the form 


T MS, for b; A 
MSerror (,) 


where Мб оң) is the pooled within-cell variance for all cells at level Б. 
The approximate degrees of freedom for this F ratio are 


1 for numerator, 
п — 1 for denominator. 
In testing simple effects for factor B at level a; the F ratio has the form 


MS, гога 


Е- А 
MSerror(a;) 
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where Моца) is the pooled within-cell variance for all cells at level dy. 
The approximate degrees of freedom for this F ratio are (15 5:1): da 
the usual test (assuming homogeneity of variance) the degrees of freedom 
for the latter F ratio are [(q — 1), pq(n — 1)] if all cell variances are pooled 
and [(g — 1), q(n — 1)] if only variances from cells at level a; are pooled. 

Alternative Notation Systems. The notation system that has been 
adopted for use in this and following sections is not the most widely used 
system but rather a slight variation on what is essentially a common theme 
running through several notation systems. Bennett and Franklin (1954) 
use a closely related notation. The equivalence between the two systems 
is expressed in terms of the following relationships: 


Cin a EX. 
In terms of the Bennett and Franklin notation system, the sums of 
Squares have the following symmetric form: 
SS, = SS, = C, — С, 
SS, = SS, = C, — C, 
55 = 55, = С — C, — C + C, 
SSw. сеп = С, = Ci. 
The notation used by Kempthorne (1952) is perhaps the most widely 
used. In this system a single observation is designated Y,;,. 


2 
cr - S. 
пра 
Жү = 2A} 
БҮЗ XB 


Уү? = =(4B;,)*. 
ij 


6.3 p xq Factorial Experiment—Unequal Cell Frequencies 
Computational procedures for an unweighted-means analysis will be 
described іп this section. The conditions under which this kind of analysis 
is appropriate are given in Sec. 5.22. For illustrative purposes the com- 
putational procedures are cast in terms of a 2 x 4 factorial experiment; 
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these procedures may, however, be generalized to any p x q factorial 
experiment. Computational procedures for the least-squares solution are 
given in Sec. 6.14. 

Suppose that the levels of factor A represent two methods for calibrating 
dials and levels of factor B represent four levels of background illumina- 
tion. The criterion measure is an accuracy score fora series of trials. The 
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Table 6.3-1 Numerical Example 


----------------------- 


Observed data: 


by by bg | by 
(i) a, | 3,467 | 5,6,6,7,7 | 46,88  |8,10,10,7,11 
Gel) 253,4 3, 5, 6,3 9,12,12,8 |9,7, 12,11 
БРЕ ТЕ). es Sa 
Cell data 
by by bs by 
nj 4 5 4 5 
А 529% 20 31 26 46 
Gi) 1 xx? 110 195 180 434 
SS;; 10.00 2.80 11.00 10.80 
пу 3 4 4 4 
d xx 9 17 41 39 
2 xx? 29 79 433 395 
58; 2.00 6.75 12.75 14.75 
талады SAREE ЛЫ ЙЫ dI c 
M 25 + 20 + 25 + 20 + 33 + 25 + 25 + 25 
ti) = 4.04 


SS, con = 2985, = 10.00 + 2.80 + +++ + 14.75 = 70.85 


original experiment called for five observations per cell. However, because 
of conditions not related to the experimental variables, the completed 
experiment had three to five observations per cell. The observed criterion 
scores are given in part i of Table 6.3-1. Summary of within-cell informa- 
tion required in the analysis is given in part ii. The variation within cell 
abı is 

(20) 


55 = 110 — = 10.00. 


The harmonic mean of the cell fre ies 1 i ES 
} quencies is computed in part iii. Т 
computational formula used is P an hé 


п 


eu: A 
XX([nj) 
The pooled within-cell variation is also computed in part iii. 
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The data in the cells of part i of Table 6.3-2 are means of the respective 
n; observations іп the cells, All the computational symbols in part ii are 
based upon these means and row and column totals of these means. In 


Table 6.3-2 Numerical Example (Continued) 


Cell means: 
| by b, bs b, Total 


(i) a 5.00 6.20 6.50 9.20 26.90 
а 3.00 425 1025 9.75 2725 


8.00 10.45 16.75 18.95 54.15 
———————Є—ЄЄ—Є 


(1) = G?/pq = (54.15)/8 = 366.53 
(2) = EX? (see part ii, Table 6.3-1) 
Gi) (3) = (®42)/у = (26.908 + 27.25%)/4 = 366.54 
(4) = (ЕВФу/р = (8.00? + 10.45% + 16.75% + 18.95%)/2 = 406.43 
(5) = ХАВ)) = 5.00? + 620? + +++ + 9.75? = 417.52 
атыны ыссы 9S) we crop ee 
SS, = 9,103) — (ІЛ = 4.04[366.54 — 366.53] = .04 
(iii) SS, = Ó[(4) — (1)] = 4.04406.44 - 366.53] = 161.24 
SS, = (5) — (3) — 4) + MI = 44.72 


ee aS ы л ЕЕЕ 
defining the computational symbols in (ii), each of the cell means is con- 
sidered as if it were a single observation. Computational formulas for 


the main effects and interaction are given in part iii. 
The analysis of variance is summarized in Table 6.3-3. The degrees of 


Table 6.3-3 Summary of Analysis of Variance 


Source of variation SS df MS F 
A (method of calibration) .04 1 04 
В (background illumination) 161.20 3 53.73 18.99 
AB 4472 3 14.91 5.27 
Within cell 70.85 25 2.83 


—  —— M ee 


freedom for the within-cell variation are Xm — pq = 33 —8 = 25. If 
factors A and В are fixed, then М8, con is the proper denominator for all 
tests. By using the .05 level of significance, the critical value for the test on 
the interaction is F,,(3,25) = 2.99. Since the observed F ratio, Е = 5.27, 
is larger than the critical value, the data tend to contradict the hypothesis of 
zero interaction, The test on the main effect for factor B has the critical 
value 2.99, The observed F ratio, F = 18.99, is larger than the critical 
value for а .05-level test. Hence the data contradict the hypothesis that the 
main effects of factor B are zero. Inspection of profiles (Fig. 6.3) of the 
simple effects of В for levels a, and a; indicates why the effects of factor 
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A are masked by the interaction. For the first two levels of factor B the 
means for level a, are higher than the corresponding means for level a;, and 
for the other two levels the means for level a, are lower than the corre- 
sponding mean for level аҙ. Opposite algebraic signs of such differences 
tend to make their sum close to zero in the main effects for factor A. 

To illustrate the computation of the simple effects, the variation due to the 
simple effects of B for level a; is obtained as follows (data for the com- 
putations are obtained from part ii of Table 6.3-2): 


(Say) = 3.00? + 4.25? + 10.25? + 9,75? = 227.19 
(3a,) = (27.25)?/4 = 185.64 
SS, tor a, ЕТ п,((5а;) NS (3а)] = 167.86 


MS, for a, = SSo foray _ 55.95, 
q—1 


A test of the hypothesis that the variance of the simple effects of factor B 
at level а; is zero is given by the F ratio 


p М® юе _ 5595 


11 = 19.77. 


Ма, сеп: 283 
10 The degrees of freedom for this F ratio are 
(3,25). 
In comparing two means, the actual num- 
9 ber of observations upon which the mean is 
based may be used, For example, 
g АВ, — AB, 
VMSv. ceu[CL/m4) + (1/33)] 
7 9.20 — 5.00 4.20 
ў 42.3(86--3) 127 
The degrees of freedom for this t statistic are 
those for MS, «n. In making all possible 
5 tests between ordered means within a logical 
grouping, the procedures given in Sec. 6.2 
4 may be followed, assuming 7, observations 
per cell. 
aL 6.4 Effect of Scale of Measurement on 
Interaction 
2 In Sec. 5.4 it was indicated that inter- 


b bz bs b actions were in part a function of the choice 
Figure 6.3 Profiles of simple of the scale of measurement. When this 
effects for factor B. is the case, interaction effects may be 
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“removed” by proper choice of a scale. A numerical example will be used 
to illustrate this point. 

The data at the left in part i of Table 6.4-1 represent observed criterion 
scores obtained from a3 х 3 factorial experiment having two observations 
percell. The symbols in part ii are defined in Table 6.2-1. The analysis of 
variance for these data is summarized in part iii. The critical value for a 
Ol-level test on the interaction is Fo9(4,9) = 6.42. Since the observed F 


Table 6.4-1 Analysis of Variance in Terms of Original Scale 
of Measurement 


Observed data AB summary table 
b, by by ТЫ тоа 
(y a| 10 12,14 20,27 а 1 26 47 74 
| 9,9 32,30 40, 55 d» 18 62 95) 175 
аз| 30,34 64,70 | 100,96 аз 64 134 196 394 
Total| 83 222 338 643 
(ii) (1) = 22,969.39 (3) = 31,889.50 (5) = 38,273.50 
(2) = 38,449 (4) = 28,402.83 
Source of variation 55 df MS F 
A 8,920.11 2 4460.06 
(ш) В 5,433.44 2 2716.72 
AB 950.56 4 237.64 12.19 
Within cell 175.50 g 19.50 
Total 15,479.61 17 


ratio exceeds the critical value, the experimental data tend to contradict 
the hypothesis that the interaction effects are zero. The profiles of factor 
B for each level of factor A are shown in Fig. 6.4. Within the range of the 
data, the profiles do not cross. Further, each profile is approximately 
linear inform. The major difference in these profiles is in the slopes of the 
respective best-fitting lines. 1 

A square-root transformation on the original scale of measurement will 
make the slopes in this kind of configuration approximately equal. From 
the AB summary table, the ranges of the respective rows, in terms ofa 
square-root transformation, are as follows: 


Row a: / pe MTS 6 
Row ay: J 95 — 4/18 = 6 
Row аз: 1196 — 64 = 6 


The fact that these ranges are approximately equal provides partial evidence 
that the square-root transformation, when applied to the original observa- 
tions, will yield profiles having approximately equal slopes. 
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180 


150 


1 2 b, 


Figure 6.4 Profiles for factor B at levels of factor A, (Original 
scale of measurement.) 


Table 6.4-2 Analysis of Variance in Terms of Transformed Scale of 
Measurement 


Observed data (transformed scale): 


by by by Ы 8 by "Total 
© 4, 1.0,0.0 | 3.5,3.7| 45,52 a, ДЛО 72:0 97 1717,9 
aa | 3.0, 3.0 | 5.7, 5.5 | 63,74 4, | 60 112 13.7 30.9 
аз | 5.5, 5.8 | 8.0, 8.4 | 10.0, 9.8 аҙ | 11.3 164 198 475 
Total | 183 348 432 963 
(ii) (1) = 515.20 (3) — 588.58 (5) — 642.38 
(2) — 643.91 (4) — 568.70 
Source of variation SS df MS E 
A 73.38 2 366 | É 
3 53.50 2 26.75 
(iii) AB 30 4 075 | F<1 
Within cell 1.53 9 A70 
Total 12871 | 17 
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Interms of the transformed scale of measurement, the observed data 
are given in part i of Table 6.4-2. The transformation has the following 
form. 

in == К Ху. 
Each entry in the table at the left of part i is the square root of the corre- 
sponding entry in part i of Table 6.4-1. The analysis of variance for the 
transformed data is summarized in part iii of Table 6.4-2. It will be noted 
that the F ratio in the test on the AB interaction is less than unity in this 
case. In contrast, the F ratio for the analysis in terms of the original scale 
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18 


15 


12 


ay 


0 es 


1 b, b, 


= 


Figure 6.5 Profiles for factor В at levels of factor A. (Trans- 
formed scale of measurement.) 


was 12.19. Тһе profiles in terms of the transformed scale of measurement 
are shown in Fig. 6.5. The magnitude of the mean squares for the main 
effects, relative to the within-cell mean square, is approximately constant 
for both scales of measurement. 

Not all interaction effects can be regarded as functions of the scale of 
Measurement. In cases where profiles cross, or in cases where the profiles 
have quite different shapes, transformations on the scale of measurement 
will not remove interaction effects. If, however, interaction effects can be 
removed by transformations, there are many advantages in working with a 
model which contains no interaction terms. This is particularly true in the 
mixed model, which has both fixed and random variables, since interactions 
between fixed and random factors form denominators for F ratios.. 


248 COMPUTATIONAL PROCEDURES AND NUMERICAL EXAMPLES 


65 p xq xr Factorial Experiment Having n Observations per Cell 


The notation and computational procedures developed in Sec. 6.2 may 
be extended to three-factor experiments as well as higher-order factorial 
experiments. In this section the extension will be made to a p xqxr 
factorial experiment. It will be assumed that there are n observations in 
each cell. Notation will be indicated for the special case of a 2 x 3 x 2 
factorial experiment. There are pqr = 2(3)(2) = 12 treatment combina- 
tions in this experiment. The notation for the treatment combinations is 
illustrated in the following table: 


сі | СЯ 


ue ТР ЕЕ POET TS 


а абси abe абез | абс» абс, абез 
98 афсу асу abc» | abt абс» абс» 
A typical treatment combination in this experiment is designated by the 


notation абс, , where i indicates the level of factor A, j the level of factor В, 
and k the level of factor С. 


The п observations under treatment combination abc; are represented as 
follows: 


(Хан Xim c Жалт 07” Хы 


Thus the notation Х, ur» denotes an observation on element m under treat- 
ment combination abc;,. For the general case, 


ies. a езі E 2 eqs k1,2,...,7; 
et Le E 


The sum of the n observations under treatment combination abc,,, will be 
designated by the symbol АВС. Thus, 


АВС,» = DX iim: 

A table of such sums will Бе called an ABC summary table. For the case of 
a2 x 3 x 2 factorial experiment, the ABC summary table has the follow- 
ing form: 


e Cy 


by by ba b, b; bs Total 


а | АВС АВС АВС ABCu; АВС») АВС А 
4; | АВС — ABC. АВС АВСыз АВС» АВС» Ay 
Total! ВС, ВС, BCA BOs ВС B G 


The column totals in this ABC summary table have the general form 
ВС, = ХАВС;,, жы УУХ. 
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That is, a column total represents the sum of all observations under treat- 
ment combination Әс, the levels of factor A being disregarded. The BC 
summary table has the following form: 


b, bs bs Total 
Сі BCy ВС, BCs, C 
C2 ВС,» ВС» BC С; 
Total В, В, Bs G 


Treatment combination ab; is defined to be the collection of treatment 
combinations айс, abC;s, ..., асу. Тһе sum of all observations at 
level ab;, is the sum of all observations in this collection. The sum of all 
observations at level ab;; is thus 


AB; = УАВС, „л. Б УР Хы, 
For the case being considered, 5 as 

АВ, = АВС + АВС, 

АВ» = АВС, + ABC: 
The АВ summary table has the following form: 


| ы % bs Total 
а ABy ABis АВІз А 
а AB» АВ» АВ» А 
Total By Bs В, G 


The symbol АС, will be used to designate the sum of all observations at 

level ас. Thu 
2 АС, = АВС,» = УУХ 
3 іт н 
For example, the treatment combinations at level асу; for the case being 
considered аге abc, aby, and а Сұ». Thus, 
АС = АВС, + АВС,» + АВС). 

The AC summary table has the following form: 


сі [o^ Total 
а, ACy АС,» 4 
а» АС АС» A» 
Total С, с, б; 


The sum of all observations at level a, may be obtained as follows: 
A= УАВ, i» ХАС, FE ХХАВС, йл > Ха» 
j k k ikm 


j j 
The sum of all observations at level b; is given by 


B; = ŽAB; = УВС Е ХХУАВС, =y УУХХ 
і k k 


Similarly, i ikm 
C, = АС = >ВС = XXABC; E УУХХ 
i 3 7 tim 


i 
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To summarize, two-way summary tables are most readily obtained from 
three-way summary tables by combining levels of one of the factors. Thus 
the BC summary table is obtained from the ABC summary table by adding 
totals in the latter table which are at the same levels of factors B and C but 
at different levels of factor A. The AB summary table is obtained from the 
ABC summary table by adding totals in the latter table which are at the same 
levels of factors A and B but at different levels of factor С. Тһе AC sum- 
mary table is obtained in an analogous manner. 


Table 6.5-1 Definition of Computational Symbols 


(1) = G?/npqr (6) = [2(4B,,)*]/nr 

= ХХ! Зат (7) = ECAC) lng 

(i) (3) = (247) /ngr (8) = [X(8Cj?]/np 
(4) = (ЕВ?)/прг (9) = (АВС, )Ч/п 


(5) = (“С )/прд 
SS, = тғХА, — б)? = (3) — (1) 
SS, = nprX(B, — Gy = (4) — (1) 
SS, = npqX(C, — бу = (5) — (1) 
SS, = nrX(4B,, — А, — B, + С)? 
= nr*(AB,, — б) — 55, — SS, = (6) — (3) — (4) + (1) 
Gi) SS, -тХАС, — А, — C, + Gy 
= (АС — б) — SS, — SS, = (7) — (3) — (5) + (1) 
58, = npX(BCj, — B, — C, + Gy 
= npX(BCj, — б) — SS, — SS, = (8) — (4) — (5) + (1) 
55,5, = nY(ABC,, — АВ) — АС — ВС, + Ay + B, + C, — бу 
= nX(ABC;j, — G)? — SS,, — SS4, — SS, — SS, — SS, — SS, 
= (9) — (6) — (7) — (8) + (3) + (4) + (5) — (1) 
55у, cot = XU. — АВС) = (2) — (9) 
SStotat = X, — G)? — (2) — (1) 


Symbols in terms of which computational formulas may be conveniently 
written are summarized in part i of Table 6.5-1. With the exception of 
symbol (2), all are obtained from either the two-way or the three-way sum- 
mary tables. In each case the divisor for a computational symbol is the 
number of basic observations summed to obtain a term which is squared in 
the numerator, For example, in computational symbol (3), A, is squared. 
There are ngr observations summed to obtain A,. 

There is a relatively simple method of determining the number of basic 
observations summed to obtain a total of the form А, Inapxqxr 
factorial experiment a basic observation is Tepresented by the symbol 
Хат. In A, the subscripts j, К, and m are missing. The numbers of levels 
corresponding to these missing subscripts are, Tespectively,q,r,andn. Тһе 
number of observations summed to obtain A, is the product of these missing 
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subscripts,qrm. In the total B;, the subscripts i, k, and m are missing. The 
corresponding numbers of levels are р, г, and n; hence the number of obser- 
vations summed is prn. In the total BC;, the subscripts i and m аге missing. 
Hence pn observations are summed'in this total. In the total AC the 
subscripts jand mare missing. Hence the number of observations summed 
in this total is qn. 

Where the index of summation does not appear under the summation 
symbol, it is understood that the summation is over all possible terms of the 
form specified. For example, the notation X(4B,;)* indicates that the sum 
is over all the pq cell totals in a two-way summary table of the form AB;;. 
Similarly, the notation X(BC,,)? indicates that the sum is over all the gr cell 
totals having the form ВС. The basic definitions of the estimates of 
variation due to main effects and interactions are summarized in part ii of 
Table 6.5-1. Corresponding computational formulas are also given. 

Ina three-factor factorial experiment there are various orders of simple 
effects. Computational formulas for these sources of variation may be 
obtained by specializing the symbols given in part i of Table 6.5-1. The 
symbol (ба;) is defined to be the equivalent of symbol (6), in which the sum- 
mation is restricted to level a;. For example, 


(бау — (AB AB? +++ + CABU 
1 , 
nr 
(ва) — (AB? + (AB) -- + (AB) 
1 nr 


Similarly the symbol (7c;) is defined to be computational symbol (7), їп 
which the summation is limited to level c,—that is, the summation is re- 
stricted to row с, of the BC summary table. М5). 

The computational symbol (9a;) restricts the summation in (9) to row a, 
of the ABC summary table. Computational symbol (9аь,) restricts the 
summation in (9) to just those totals in which the factor A is at level a, and 
factor B is at level bj. For example, 


Qu, _ ABO + (ABO? ABC, 
MT. 


n 
(оа) = (ABC + (АВСьы + +++ + (АВС) 
237 = 
n 


In terms of computational symbols defined in this manner, computational 
formulas for various orders of simple effects are given in Table 6.5-2. 
The following relationships hold for the computational symbols: 
(6) = (ба) + (баз) + "+ (625). 
(7) = (Тау) + (Taz) + 2-00), 
(9) = (941) + 92) + °° + @а„), 
(9) = XXQab;j. 
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Analogous relations hold for other computational symbols. 

Numerical Example. The computational procedures will be illustrated 
by means ofa2 x 3 х 2 factorial experiment. The purpose of this experi- 
ment is to evaluate the relative effectiveness of three methods of training 


Table 6.5-2 Computational Formulas for Simple Effects 


Effects Sum of squares 


Simple interactions: 


AB for level c; (9c,) = (7с) — (8с) + (Sey) 

АС for level b; (9bj) — (6b;) - (8b;) + (4b;) 

BC for level a; (9а) — (ба) - (7а) + (За) 
Simple main effects: 

A for level by (6b;) - (45) 

A for level сь (Tex) = (Sey) 

B for level a, (ба) — (3a;) 

B for level сь (Вс) — (Sey) 

С for level а, (Та) = (За) 

С for level b; (8b;) - (45) 
Simple, simple main effects: 

A for level bes, (9bcj,) — (8Ьсу) 

В for level ас; (9ac;,) - (Тас) 

С for level ab;; (9ab,) - (ба) 


Ж _—_——————————————= ŘĖÁI 
Computational checks: 


Xss, fore, = SS, + 55, 
ESS op for “ 7 SSan + $$ ье 
ESS, for bey = SS, + $$ ь + $$ + 55,, 
— тты 


(factor B). Two instructors (factor C) are used in the experiment; subjects 
in the experiment are classified on the basis of educational background 
(factor 4). The plan for this experiment may be represented as follows: 


Instructor: | сі | “4 


Training method:| b, b, ji b, by by 


Educational a | бш Gin Gin | Guz біз Gis 
level а, | Gu бшщ Gm | быз б Gage 


In this plan бі represents a group of subjects at educational level а, 
assigned to instructor c, to be trained under method bı. The symbol бұ» 
denotes the group of subjects at educational level a, assigned to instructor ca 
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to be trained under method b. Thus each instructor teaches groups from 
both educational levels under each of the training methods. It will be 
assumed that there are 10 subjects in each of the groups, a total of 120 sub- 
jects in all. 

In this experiment the methods of training (factor B) and the levels of 
education (factor A) will be considered fixed factors. Factor A is a classifi- 
cation variable included in the experiment to control potential variability in 
the experimental units, which is a function of level of education. (АП 
relevant levels of the education factor must be covered if this factor is fixed.) 
Factor B is the treatment variable of primary interest; this factor is directly 


Table 6.5-3 Expected Values of Mean Squares for Numerical Example 
(A, B fixed; C random) 


Effect i j k m Expected value of mean square 
J ре 5 is 
a 0 3 2 10 о? + 3002, + 6002 
В, 2 0 2 10 о? + 200%, + 400} 
Ук 2 3 1 10 о? + 600% 
ор; 0 0 2 10 0? + 200°; 
ву 0 3 1 10 o? + 300°, 
Вуд 2 0 1 10 a + 2003, 
ШАЛ 0 0 1 10 o? + 1003, 
Етік) 1 1 1 1 a 
се! a ———--— 


under the control of the experimenter. There is some question about 
whether or not factor C should be considered a fixed variable. If it is the 
purpose of the experiment to draw inferences about the methods of training 
which potentially hold for a population of instructors, of which the two 
instructors in the experiment can be considered a random sample, then the 
instructor factor is random. If inferences about the methods are to be 
limited to the two instructors in the experiment, then the instructor factor is 
fixed. Often in this type of experiment, inferences are desired about the 
methods over a population of specified instructors. Hence the instructor 
factor should be considered as a random factor, and suitable randomization 
procedures are required in the selection of the instructors. . ) 
The expected values for the mean squares for the case under consideration 
are given іп Table 6.5-3. Тһе model from which these expected values were 
obtained includes interaction terms with the instructor factor. According 
to these expected values, the test on the main effect of factor B has the form 


MS, 
MS, 


This F ratio has degrees of freedom [4 — 1), (q — Dr. — Dl. which in this 
case is (2,2). When the denominator of an F ratio has only two degrees of 


Ет 
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Table 6.5-4 Data for Numerical Example 


ЕЕЕ 


ABC summary table AB summary table 
с % | 
b, ba bg | b, b, bs | Total b, 5 b, Total 
a| 20 30 12 | 16 33 8| 119 a | 36 63 20 119 
а,| 36 38 40 40 44 42 240 а, | 76 82 82 240 
56 68 52 | 56 77 50 | 359 112 145 102 359 
(i) BC summary table AC summary table 
b by b, Total су c; Total 
С КС50Е 2 5877752-77176 2141596259557 119 
а; лы Sg 1183 аъ | 114 126 240 
11244 145- 102 359 176 83 359 
(1) = (359°)/120 = 1074.01 
(2) = (not available from above data) = 1360 
(3) = (119° + 240°)/60 = 1196.02 
4 (4) = (112? + 145° + 102*)/40 = 1099.32 
(ii) (5) = (176° + 183°)/60 = 1074.42 
(6) = (36° + 63* + 20° + 76° + 822 + 822)/20 = 1244.45 
(7) = (62° + 57° + 114° + 126°)/30 = 1198.83 


(8) = (56° + 68° + 52° + 56° + 77° + 50%)/20 = 1101.45 


(9) = (20° + 302 + --- + 44° + 422/10 = 1249.30 
SS, = (3) - (1) = 122.01 SS,, = (6) - (3) — (4) + (1) = 23.12 
SS, = (4) — (1) = 25.31 55,, = (7) — (3) - (5) + (1) = 240 


(іі) 55, = (5) – (1) = 041 SSe = (8) — (4) — (5) + (1) = 1.72 
55е = (9) — (6) - (7) — (8) + (3) + (4) + (5) — (1) = 0.32 
55у. сеп = (9) — (2) = 110.70 
SStotar = (2) — (1) = 285.99 
еер ы Ee Aem at angl i 


freedom, the power of the resulting test is extremely low. This F ratio does 
not provide a sufficiently powerful test of the main effects of factor B to be 
of much practical use. 

If, however, it can be assumed that interactions with the random factor 
(C) are negligible relative to the other uncontrolled sources of variation 
which are included in the experimental error, then interactions with factor C 
may be dropped from the original model. In terms of a model which does 
not include such interactions, relatively powerful tests on factor B are avail- 
able. Inspection of Table 6.5-3 indicates that MS, MS,, М5,,, and 
MS,, сеп are all estimates of variance due to experimental error if inter- 
actions with factor C are not included in the model. Preliminary tests on 


the model may be made to check on whether or not such interactions 
may be dropped. 


—=_ —M 
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Suppose that the 4BC summary table for the data obtained in the experi- 
ment is that given in part i of Table 6.5-4. Each of the entries in this table 
is the sum of the 10 criterion scores for the corresponding group of subjects. 
For example, the entry in cell асу» is the sum of the 10 criterion scores for 
the subjects in group Gigs, that is, the subjects at education level a, trained 
under method 4, by instructor с». The two-way summary tables given in 
part i are obtained from the three-way summary table. For example, the 
entry ab; in the AB summary table is given by 


АВ, = АВС + АВС,» 
= 20 + 16 -36 
The computational symbols defined in part i of Table 6.5-1 are obtained 
Table 6.5-5 Summary of Analysis of Variance 


Source of variation 55 | df MS 
A (level of education) 122.01 pen | 1201 
B (methods of training) 25.31 4-1-2 12.66 
C (instructors) 0.41 r-1=1 0.41 
AB 23.12 (p-1Xq—1) =2 11.56 
AC 2.40 (р = Wr =) =1 2.40 
ВС 172 (9 = De —1)=2 0.86 
АВС 032 | (р= 09 – 00—10) =2 0.16 
Within cell 

(experimental error) | 110.70 | рфг(п - 1) = 108 1.02 

Total | 28599 | 119 


ee оа афза Е ала аа 


іп part ii іп Table 6.5-4. Data for all these computations, with the excep- 
tion of symbol (2), are contained in part i. Symbol (2) is obtained from the 
individual criterion scores; the latter are not given in this table. The 
computation of the sums of squares is completed in part iii. i 

The analysis of variance is summarized in Table 6.5-5. Preliminary tests 
on the model will be made on the interactions with factor C (instructors) 
before proceeding with other tests. According to the expected values of the 
mean squares given in Table 6.5-3, tests on interactions with factor C all 
have MS, «n as a denominator. These tests have the following form: 


ПТ М& Е 102 


Only the F ratio for the test on the AC interaction is greater than unity. 
By use of the .10 level of significance, the critical value for the latter test is 


Е»)2,108) = 2.76. 
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The outcome of these preliminary tests on the model does not contradict 
the hypothesis that interactions with factor C may be considered negligible. 
(The AC interaction is a borderline case.) On a priori grounds, if the 
instructors are carefully trained, interaction effects with instructors may 
often be kept relatively small. On these bases, the decision is made to drop 
interactions with factor C from the model. The expected values corre- 
sponding to this revised model are obtained from Table 6.4-3 by dropping 
the terms o3,, o2,, and озу, from the expected values in this table. Since 
the degrees of freedom for the within-cell variation (108) are large relative to 
the degrees of freedom for the interactions with factor C, which total 5, 
pooling these interactions with the within-cell variation will not appreciably 
affect the magnitude of М5, en or the degrees of freedom for the resulting 
pooled error term. Hence М5, cen is used under the revised model for 
final tests on factors A and B. These tests have the following form: 


MES рз E 525452. 
M w. cell 
MS, 
=——— = 12.41, F 99(2, 108) = 4.82; 
М5,, cell - 
MS, 
ELE 19:62) F g9(1, 108) = 6.90. 
М$у, сеп a ) 


Because of the significant AB interaction, care must be taken іп inter- 
preting the main effects due to factor 8. The manner in which educational 
level is related to method of training is most readily shown by the profiles of 
the simple effects for the methods at each of the levels of education. These 
profiles are drawn in Fig. 6.6. Data for these profiles are obtained from 
the AB summary table. Inspection of these profiles indicates that differ- 
ences between the methods of training for groups atlevela, are not somarked 
as the corresponding differences for groups at level а). 

Variation due to differences between training methods for groups at level 
а, is given by (data are obtained from row a, of the AB summary table) 


36° + 63° +20? 119° 
SS at 8, — 
20 60 


This source of variation has q — 1 = 2 degrees of freedom. Hence the 
mean square of the simple effect of factor B for level a, is 


MS, tora, = an = 23.62. 


To test the hypothesis of no difference between the methods of training for 
groups at educational level a,, 


MS 23.62 
= a 803.15. 
М5», сеп 1.02 


47.23. 


F 
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The degrees of freedom for this F ratio are (2,108). The data clearly 
indicate a significant difference between the methods for level aj. The test 
on the simple effects of methods for level a, indicates no significant difference 
between the methods; the F for this test will be found to be F — 0.60/1.02. 

The levels of factor C (instructors) in this design may be considered to be 
replications. As indicated by the expected values for the mean squares, the 
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Figure 6.6 Profiles of simple main effects for training methods. 


proper denominator for tests on fixed effects is the corresponding interaction 
with replications, provided that such interactions cannot be pooled. When 
tests are made by usinga pooled error term (this is essentially what has been 
done in the example that has just been considered), it is implicitly assumed 
that interactions with replications (instructors) do not exist. —- р 

There аге potentially many reasons for the presence of interaction in this 
kind of experiment. Inspection ofthe A x Bsummary table indicates that 
groups at level a, are uniformly good under all the training methods. In 
part, this may be a function of a ceiling effect on the criterion. | If the latter 
is the case, then the interaction is an artifact of the way in which perform- 
ance is measured. Care is required in constructing the criterion to avoid 
this kind of artifact. 
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Alternative Computational Procedures for Three-factor Interactions. In 
Sec. 5.10 the relationship between two-factor and three-factor interactions 
was indicated. Fora2 x 3 x 2 factorial experiment, 


SSave = SSab ate, + SSabate, — 85а. 


In general, a three-factor interaction will be zero whenever the sum of the 
simple two-factor interactions is equal to corresponding over-all two-factor 
interaction. Thus, the three-factor interaction is a measure of the additiv- 
ity of simple two-factor interactions. 

The relationship given above provides a method for computing the three- 
factor interaction. For the data in Table 6.4-4, one may compute the 
following computational symbols: 


2 21 122 1 362 1 392 2 
(ва) — 207+ 308 + 12 + з6ё + 38+ 40" srg 4g 
2 4/432 2 | 4n? 24 402 
Oop EIB EM + ме Rr оу 
10 
2 2 2 
(6045456 068 2-53) 
20 
562 + 772 + 502 
йг lla 24-2 2” 
(8) 20 ? 
2 2 
Ее 
30 
57° + 126° 
0271126 63750, 
( Cy) 30 
176° 183° 
Ge =з1627, “(=з 
1 60 (5с) "m 558.15. 


The variation due to the simple AB interaction for level c, is 
(96) — (7с) — (86) + (5с) = 10.14. 
The corresponding variation for level Съ is 
(9с;) — (7e3) — (8е;) + (5e) = 13.30. 


The variation due to the over-all AB interaction was found to be 23.12. 
Hence the variation due to ABC is 


SS,,, = 10.14 + 13.30 — 23.12 = 0,32, 
6.6 Computational Procedures for Nested Factors 


, Factorial designs in which one or more factors are nested were discussed 
in Sec. 5.12. Computational procedures in which one factor is nested 
under a second will be illustrated from the case of a p x q x r factorial 
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experiment having п observations in each cell. Assume that factor В is 
nested under factor A. The analysis of variance for this case generally 
takes the following form: 


Source of variation df 

A peal 

B within A pq —1) 

С r —1 

AC (р = 09 —0 
(B within А) x С р — yr – 1) 
Within cell рға - 1) 


------------------ 


The variation due (о В within A is defined to Бе 
55м) = пгУУ(АВ,; — Aj. 
із 
This source of variation is actually а sum of simple main effects of factor 


Bateach level of factor А. In terms of the computational symbols defined 
in Table 6.5-1, 


55м) = (6) — (3). 
For a design in which factor B is not nested under factor A, 
df 
SS, = (4) — (1) 4-1 
55 = (6) — G) — 4) +0) (p D =!) 
Sum (6) — (3) ра = 1) 


Thus SS, in a design in which factor B is nested under factor 4 is numeri- 
cally equal to SS, + SS,, in the corresponding factorial design in which 
factor B is not nested under factor A. ш А 
The variation due to the interaction (B within A) x C is defined to be 
55а =E У У (АВС, ж. AB; = АС, — 4)? 
This source of variation is actually a sum of simple interactions. In terms 
of the computational symbols defined in Table 6.5-1, 
SS, = 9) — ©) —M + 0). 
For a p x q х r factorial experiment in which there is no nested factor, 
df 


55, = (8) — (4) — (5) + (1) q — Vr D 
55. = (9) — (6) — (7) — (8) + (3) + (0 + (5) -W {уел БЫШЫ 
Som 0 — (6) — () + @) ра — De — 1) 
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Thus 55,4): is numerically equal to SS,, + SS,,.. In general, if factor V 
is nested under factor U, 


55, = SS, + SSuvw- 
In а four-factor experiment in which factor C is nested under both factors 
A and B, 
55, = SS, + SS, + 55,, + 55,,,, 
SSc(avya = SS, + 55,4 + 55,4 + $$. 
Returning to a three-factor factorial experiment, consider the case of a 


2 x 2 x 3 factorial experiment having five observations per cell. Assume 
that factor B is nested under factor A and that factor B is a random factor. 


Table 6.6-1 Expected Values of Mean Squares 
(Factor B nested under factor A; A and C fixed, B random) 


Effect i J k m E(MS) 

" 0 © 28 sha hs 02 + 150% 4) + 3002 
Buy E ecd con 50 S 

Yk 2 2 0 5 в + 56%), + 2002 
«ук 0 2 0 5 о + 50505, + 1002, 
Вук 1 1 0 5 9; + 56%), 

етік 1 1 1 1 өз 


" 


—— — a PES A 
Assume also that factors A and C are fixed factors. Under these assump- 
tions, the expected values of the mean squares for this design are given in 


Table 6.6-1. From these expected values it will be noted that the test on the 
main effects due to factor A has the form 


ки MS via) 
If this denominator has relatively few degrees of freedom, the power of the 
test will be low. Preliminary tests on the model are often called for in this 
situation. 

Numerical Example. A 2 x 2 x 3 factorial experiment having five 
observations per cell will be used to illustrate thecomputational procedures. 


To make this example concrete, suppose that the experiment has the follow- 
ing form: 


Drugs: а аз 

Hospitals: bia) (а) b(a) b(a) 

Category of 

patients: Cy 0% Cg |Cy са 0% | Сі (6а e Гер 41 es 
т 5а 19245 ЕЗУ renee) ЕЕ Б 
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Suppose that the purpose of this experiment is to test the relative effective- 
ness of two drugs on patients in different diagnostic categories. Patients 
from hospitals Б, and bq, are given drug а,; patients from hospitals 
by, and bs, are given drug as. Since only one of the drugs under study 


isadministered within a hospital, the hospital factor is nested under the drug 
factor. The diagnostic categories are considered to be comparable across 


Table 6.6-2 Numerical Example 


нм UE nes ee Ed 


ABC summary table: 
ау а 
Total 
bita) bara) bitap) bora) 
сі 15 18 30 35 98 
e TA 25 245 21 92 
сз 38 41 10 14 103 
Total| 75 84 | 64 70 | 293 
@ AC summary table: 
| a аз Total 
Elo 2. 65 98 
ema) 45 92 
yall go 24-1 1035 
Total | 159 134 293 
(1) = (293)?/60 = 1430.82 
(2) (not obtained from above summary tables) = 1690 
3 (3) = (159? + 1342)/30 — 1441.23 
(ii) (5) = (982 + 922 + 103)*/20 - 1433.85 
(6) = (75% + 842 + 642 + 70°)/15 = 1445.13 
(7) = (332 + 47 ++ 45? + 24?)/10 — 1636.50 
(9) 55 (155: 222 +- + 21 + 142)/5 = 1644.20 


all hospitals; hence factor С is not nested. Only a random sample of the 
population of hospitals about which inferences are to be drawn is included 
in the experiment; hence factor B is random. СЕРУ | 

Suppose that the АВ summary table for the data obtained in this experi- 
ment is that in part i of Table 6.6-2. With the exception of symbol (2), 
which is computed from the individual observations, data for the com- 
putations are given in part i. (The definitions of these symbols appear in 
Table 6.5-1.) Symbols (4) and (8) are not required for this case. The 
analysis of variance is summarized in Table 6.6-3. 

The expected values of the mean squares are given in Table 6.6-1. | Before 
testing the main effects and interaction of the fixed factors, preliminary 
tests on the model are made with respect to factor B and its interaction with 
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factor C. These preliminary tests will be made at the .10 level of signifi- 
cance. Inspection of the expected values of the mean squares indicates 
that the preliminary tests have the following form. 

MS, 
MS, сеп 


F= = 1.37, Е.,(3,48) = 2.22; 


Е = Ме < 100, (6,48) 192. 
MSy. сеп 


Table 6.6-3 Summary of Analysis of Variance 
=—————————-———————. 


SS, = (3) — (1) = 1041 

85) = (6) — (3) = 390 

(i) SS, = (5) - (1) = 3.03 
58, = (7) — (3) — (5) + (1) = 19224 

55мауе = (9) — (6) — (7) + (3) = 3.80 

55, cen = (2) — (9) = 45.80 


ee 


Source of variation SS df MS 

A (drugs) 10.41 1 10.41 

B (hospitals within A) 3.90 3 1.30 

(ii) C (categories) 3.03 2 1.52 
AC 192.24 2 96.12 

B(A) x C 3.80 6 0.63 

Within cell 45.80 48 0.95 

Pooled error 53.50 57 0.94 


Neither of the F ratios exceeds specified critical values. Hence variation 
due to B(A) and B(A) х Cis pooled with the within-cell variation. Thus, 


SSpootea error = 55,4) F 55,0 ЗЕ SS, сеп. 
The degrees of freedom for this term are the sum of the respective degrees 


of freedom for the parts. 


The denominator for all final tests is MS 


pooled errore For this case the 
final tests are 


MS, 
= £ = 102.26, Е 99(2,57) = 5.00; 
MSpoolea error Гу ) 
MS, 
- E — 11.07, F (1,57) = 710; 
MSpootea error aol ) 
MS 
FL———- 1.62, F (2,57) = 5.00. 
MSpootea error Гу ) 


In spite of the significant АС interaction, the main effect for factor A is 
significant. An analysis of the simple effects is required for an adequate 
interpretation of the effects of the drugs. Inspection of the AC summary 
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table indicates that drug a, has the higher criterion total for category с; 
there appears to be little difference between the criterion scores for category 
сы; drug a, has the higher criterion total for category сз. Formal tests on 
these last statements have the following form: 


(АС — АС)? (33 — 65) 


Е, 54.47, 
2ngMSpootederror 2(5)(2)(0.94) 

р (АСв— AC (07—45* 0) 
2nqMSpooiea error 2(5)(2)(0.94) 
2 LE 2 

p — (ACs — AC») _ (79 = 24) _ 160 99, 


2ngMSpootederror 2(5)(2)(0.94) 


6.7 Factorial Experiment with a Single Control Group 


A design closely related to one reported by Levison and Zeigler (1959) will 
be used to illustrate the material that will be discussed in this section. The 
purpose of this experiment is to test the effect of amount and time of irradia- 
tion upon subsequent learning ability. Different dosages (factor B) of 
irradiation are administered at different age levels (factor A). When all 
subjects reach a specified age, they are given a series of learning tasks. 
Separate analyses are made of the criterion scores for each task. The 
plan for the experiment may be represented as follows: 


Dosage of irradiation 


bi be 

a Gy Gi 

Age at which а» Go б» 
irradiation is аз Ga Сз» 
administered |а, Gu Gaz 


Suppose that each of the above groups contains 10 subjects. In addition to 
these eight groups, there is a control group Go, having 20 subjects, which 
receives no irradiation treatment. 

Thus there are nine groups in all, eight groups in the cells of the 4 x 2 
factorial experiment plus a control group. For the general case there will be 
pq + 1 groups. The analysis of variance for this experimental plan may 
take the following form: 


Between groups (pq + 1)-1= 8 
Control ys. all others 1 
A (age) p-123 
B (dosage) Ф=1 = 1 
АВ (р= 04-1) = 3 
Within cell pain — 1) + (по — 1) = 91 
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The one degree of freedom for control ys. all others contrasts the control 
group with all the experimental groups combined. It is also of interest to 
contrast the control group with each experimental group. The pro- 
cedure described in Sec. 3.10 may be adapted for this purpose. In this case 
the г statistic has the form 
~ С, em АВ; 

VMSw. а/ш) + (1/n)] 


where С, represents the mean of the control group. Critical values for this 
t statistic are given in Table B.6. Тһе degrees of freedom for this statistic 
are those of MS, «n; k corresponds to the total number of groups, which 
in this case is pg + 1. 

Tn the analysis of variance for the factorial part of the experiment the con- 
trol group is disregarded. An alternative analysis considers the control 
condition as defining an added level for one factor. That is, the plan 
is considered to be a p х (q + 1) factorial experiment. In the latter case, 
the plan may be represented as follows: 


t 


Dosage of irradiation 


by ы b, 

1 а | Gy Giz 

Age at which а, С, G5 Сб» 
irradiation is аз : Сз Сз» 
administered а; | 41 бі» 


Since group G, received no irradiation, it cannot be classified along the age 
dimension. From one point of view, however, G, may be considered as 
belonging to all the age classifications. The author does not agree, in 
general, with this latter point of view, but if it makes good experimental 
sense to adopt this point of view, then the data from group G, are used in all 
cells under by. If this is done, the analysis of variance has the following 


form: 
eee M 
A (age) p—l- 3 
B (dosage) 4-2 
АВ қ р9 = 1) = 6 
Within cell рдп — 1) + (ny — 1) — (р — 1) =88 


Since the data from G, are used in p different cells of the experiment, the data 
within each cell are not Statistically independent. To compensate, in part, 
for this lack of statistical independence, p — 1 degrees of freedom are sub- 
tracted from the estimate of experimental error. 

Numerical Example. Suppose data obtained from the eight experi- 
mental groups in the experiment which has just been described are those 
given in part i of Table 6.7-1. Each entry in the AB summary table is the 
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sum of 10 observations. The computational symbols in part i are defined 
in Table 6.5-1. [Data for the computation of symbol (2) are not given.] 
Summary data for the control group are given in partii. It is noted that the 
number of subjects in the control group is 20, whereas the number of sub- 
jects in each of the experimental groups is 10. 


Table 6.7-1 Numerical Example 
_——————— 


AB summary table (л = 10): 


| by by Total 
(1) = 143,143 
a, | 380 310 60 (2) = 148,129 
(i) а» 405 340 745 (3) = 146,559 
a, | 5485 57410: 7 %055 (4) - 143,479 
a 504 490 994 (5) = 147,037 
1774 1610 3384 
Data for control group (по = 20): 
T7 2 (1000): 
(ii) C,- УХ 1000 УХ = 50,300 55, = 50,300 - —0 
= 300 
5. (1000) — (3384)/8}? 
(іі) SScontrol vs. all others = ETE = 527 
EEE к лал = 
Source of variation | 55 df | MS | Е 
5 = ғ - = ДЕ р = ESAE 2 
Control vs. all others | 527 1 | 527 34.44 
A (age) | 3416 | 3 1139 
(iv) B (dosage) | 336 | 1 336 | 
AB | 12| 3 | 413 3.09 
Within cell | 1092 + 300 = 1392 | 91 15.30 


The comparison between the control group and all other groups com- 
bined has the general form (assuming each mean is based upon л obser- 
vations) е“ 

n[pqCo — EZAB; 

SSeontrot vs. att others = pag рр 

pe [(n/no)Co = (1/pqyG} ч 
n[1 + (1/р4)] 
The numerical value of this expression is obtained in part ш. A summary 
of the analysis of variance is given in part iv. The within-cell variation is 
the pooled variation for the eight experimental groups and the control 
group. The degrees of freedom for the within-cell variation are 

ра(п — 1) + (m — 1) = 89) + 19 = 91. 
Р The test on the interaction indicates that this source of variation is 
significantly greater than zero, F'95(3,91) = 2.71. The profiles for factor A 
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at levels b, and b; as well as the control, bọ, are plotted in Fig. 6.7. (Data 
for the control group are in units which are comparable with those in the 
experimental groups.) Inspection of these profiles indicates relatively 
large differences between the control group and the groups which were 
irradiated at ages a, and a». There is relatively little difference between the 


Criterion measure 


а, а; аз a4 
Age at irradiation 
Figure 6.7 Profiles for different dosages. 


control group and the groups given irradiation at ages a, and a, Further, 
for groups given irradiation at ages a, and аз, the groups given the larger 
dosage (№2) showed the greater decrement іп performance. A formal test 
on this latter statement uses the statistic 
[um ПАВ, + АВ) — (АВ + ABs») |? 

4nMS. сеп 
— [0380 + 405) — (310 + 340) 

40(15.30) 

The critical value for this test is Е (1,91) = 3.95. 
To test the hypothesis that groups given irradiation at ages a, and a, do 


not differ from the control group, the following comparison may be used: 


Ен — (As + лә] 


= 29.78. 


pz Nong 
6ngMSy. сеп 
2(1000) 2 
— (9 
[ 20/20 | 35 + 994) 


L 120(15.30) Э 
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For a test at the .05 level of significance, the critical value for this statistic 
is Ғ.(1,01) = 3.95. Thus the data indicate there is no statistically signif- 
icant difference between the groups irradiated at ages а; and a, and the 
control group with respect to the criterion of performance measured. 

The alternative analysis, which considers the control group as a part of 
the factorial experiment, is summarized in Table 6.7-2. The entries in 


Table 6.7-2 Numerical Example 


L—————á $a артсын ы 
АВ summary table: 


bo by bs Total 
— = (1) = 241,562 
a, | 500 380 310 1190 (2) = 
бі) аз 500 405 340 1245 (3) - 243,840 
аз 500 485 470 1455 (4) = 243,479 
a, | 500 504 490 1494 (5) = 247,037 


[2000 1774 1610 5384 


Source of variation | $$ df | MS | F 
A (age) | 2278 з |79 | 
(ii) B (dosage) 1917 2 | 986 
AB 1280 бз 213 13.46 
Within cell | 1392 88 15.82 


———— SS л o ЕЕ 
column 5, of part i have the general form пСу/т, where Cy is the sum of the 
N observations in the control group. Computational symbols (1), (3), (4), 
and (5) are obtained from the data in parti. The within-cell variation in 
part ii is obtained from Table 6.7-1. 

In comparing the two alternative analyses, it will be noted that the sum of 
squares for the main effect of factor A is larger in the first analysis than it is 
in the second analysis (3416 versus 2278). The second type of analysis will 
always lower this source of variation. Conversely, the alternative analysis 
will always tend to increase the magnitude of the variation due to interaction. 
For the numerical data being considered, both analyses lead to essentially 
the same conclusions. 


6.8 Test for Nonadditivity 


The assumptions underlying this test were discussed іп Sec. 5.20. It 
Should be remembered that only one of several possible sources of non- 
additivity is checked in this test —namely, that source which is related to the 
linear by linear component ofaninteraction. The numerical details of this 
test will be considered in this section. ^ 

In some cases which arise in practice, a factorial experiment may have 
only one observation in each cell. If the strictly additive model (no inter- 
action effects) is appropriate, then what are computationally equivalent to 
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interaction effects may be used as estimates of the experimental error. 
Computational procedures will be illustrated for the case of a 3 x 4 fac- 
torial experiment having one observation per cell. Suppose that the basic 
data for this illustration are those given at the left of part iin Table 6.8-1. 


Table 6.8-1 Numerical Example 


A; с = A; — г Хс(АВ;) = d; 
7 


a, 8 12 16 20 56] 14 3 104 
Gel) 222 214518 36 9 -2 168 
бі) a 5 4 9 22 40) 10 -1 JE NN 
Sum| 15 18 39 60 132 | 11 =@ 5 — Sed; = -190 
LS зала os MEET i 
Braes 612 20 De? = 14 
c-B,—-6|-6 -5 2 9 Xd = 146 
NET ee ы т иж ,„ 
(1) = 1452 (2) = 1998 (3) = 1508 (4) = 1890 
T Усад} (—190)2 
(ii) 55 = Qd = = 
ші — (Gu) (бойбу 190 
Source of variation 55 df MS F 
A 56.00 2 
X B 438.00 3 
(iii) AB 52.00 6 
Nonadd 17.66 1 17.66 2.57 
Balance 34.34 5 6.87 


5 EE Ea E а. 


In the column headed 4 ; the mean of the observations in the corre- 


sponding row is entered. In the column headed с an entry has the form 
A,— б. For example, 


с-14-11- 3, 
= 9—11— —2, 
The entries in the row headed с; have the form В, — б. 


An entry in the column headed d, is a weighted sum of the entries in the 


corresponding row, the weights being respective entries in row с. For 
example, 


d, = (—6)(8) + (—5)(12) + (2)(16) + (9)(20) = 104, 

d, = (—6)(2) + (—5)(2) + (2)(14) + (9)(18) = 168, 

ds = (—6)(5) + (—5)(4) + (2X9) + (9)(22) = 166. 
The numerical value of the comparison for nonadditivity is 


Хос) = Уса, 


= (3)(104) + (—2)(168) + (—1)(166) = —190, 
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The sum of squares corresponding to this comparison is computed in part 
ii. The numerical values for the computational symbols defined in Table 
6.2-1 are also given in part ii. 

As a partial check on the computation of SSnonaaa» the following relation- 
ships must hold: 

pa(Eci(Zc5) = (55,)(55.), 
8(14)(146) — (56)(438), 
24,528 — 24,528. 


The balance, or residual variation, is given by 
SSpai = SS, — SSnonaaa- 


Since 55, лаа is a component having a single degree of freedom, SSpu has 
(p — 14 —1) — 1 degrees of freedom. The F ratio in the test for non- 
additivity has the form 


p — MSnonaaa _ 17.66 _ 4 57, 
MSpat 6.87 


If the decision rule leads to accepting the hypothesis of additivity, then 
the strictly additive model is used in subsequent analyses. On the other 
hand, if this hypothesis is rejected, the more complete model is used in sub- 
sequent analyses. The latter model is generally the more conservative in 
the sense that higher F ratios are required for significance at a specified level. 
If the level of significance is set at a numerically high value (say, « = .25 
rather than x = .05 or .01), the type 2 error becomes relatively low. In this 
case low type 2 error implies low probability of using the additive model 
When in fact it is inappropriate. E Ё 

If this test is made at the .25 level of significance, the critical value is 
Е,(1,5) = 1.69. Since the observed value of ће F statistic exceeds this 
critical value, the hypothesis of a strictly additive model is rejected at the 
.25 level of significance. If, however, MS,, is used as the denominator in 
testing main effects, the resulting test will be biased in the direction of giving 
too few “significant” results. ; Д 

There is an alternative method for computing SS,on.aq Which lends itself 
more readily to direct generalization to higher-order interaction effects. 
As a first step in the computation of the comparison desired, one sets up a 
table in which the entry in cell ij is c;c;. For the data in Table 6.8-1 the 
resulting table is as follows: 


b, by bs by Total 
a =18 515 6 27 0 
а 12 10 ЖЕЙ. ЕС 0 
аз 6 5 =2 =2 0 


Total 0 0 0 0 
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The entry in cell аб, = (3)(—6) = —18; the entry ар, = (3)(—5) = —15. 
As a check on the numerical work, each row total must be zero; each column 
total must also be zero. 

The comparison in the test for nonadditivity is a weighted sum of the data 
in the upper left-hand portion of part i of Table 6.8-1, the weight being the 
corresponding cell entry in the table given above. Thus, 


EXc,c;X;, = (—18)(8) + (—15)(12) + - + (—2)(9) + (—9)(22) 
= —190. 
As а check on the numerical work, 
XX(ccjf = (2) с). 
The sum on the left is given by 
(—18)* + (—15)? + +++ + (—2)* + (—9)* = 2044. 
The term on the right is given by 
(14)(146) = 2044. 

This latter computational scheme is readily extended to the case of a 
p X q X r factorial experiment. Тһе data in Table 6.8-2 will be used to 
indicate the numerical details. These data representa 3 x 3 x 3 factorial 
experiment in which there is one observation per cell. The usual summary 
tables for a three-factor factorial experiment are given in part ii. Data for 
the latter are given in part i. Numerical values of the computational sym- 
bols defined in Table 6.5-1 are given in part ii. Since there is only one 
observation per cell, symbols (2) and (9) are identical, i.e., 

EX, = (АВС). 
(The letter c is used for two different concepts, but the context should make 


clear what is meant. In one context сі, Сә, and сз represent the levels of 
factor C. Ina second context c;, сз, and c, represent deviations from the 
grand mean.) 

Means and the deviations of the means from the grand mean are com- 
puted in partiv. In this context c, — C, — G. Similarly, с, = A, — G. 
The entry in cell abc; in part v has the general form C;CjCy Thus the entry 
in cell abe,,, is (—3.4)(—1.8)(—5.1) 31.2. The entry in cell abc,» is 
(—3.4)(1.2)(3.2) = —13.1. Asa check on the computational work, 


Leese = Dees, = Dece, = 0. 
j D 


That is, the sum of any column in part v must be zero; also the sum over 5; 
within any fixed level of ac;, must be zero. For example, for level ас 


—31.2 + 20.8 + 10.4 = 0.0. 
For level асу», 


11.6 + (—7.7) + (—3.9) = 0.0. 
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Table 6.8-2 Numerical Example 
_——-——-—--—— 


сі C2 єз 
Total 
b bg bg БОЛТЫН [23 iC ER ORE 
ay, 3 6 9 6 J 12 9 12 15 81 
а 6 9 12 12 18 21 15 21 21 135 
аҙ 9 9 3 18 21 12 18 18 12 120 
18 24 24 36 48 45 42 51 48 336 


AB summary table AC summary table 
b by b, Total сі Co c, Total 
2a; |) 18: 5 2700886 81 д | ЕЕЕ 2721 36 81 
(i) cag} 93. 48. 54 185 22827 51.587 2,135 
а| 45 48 27 120 alee АМЕ Гав 2120 
96 123 117 336 66 129 141 336 
BC summary table 
сі Co са Total 
hy | 182 367722 96 
b,| 24 48 51 123 
bs. |: 24 7 451 (482 ІТ 
66 09 141 336 
(1) = 4181.33 (4) = 4226.00 (7) = 4758.00 
(iii) (2) — 4986.00 (5) — 4542.00 (8) = 4590.00 
(3) = 4354.00 (6) = 4572.00 (9) = 4986.00 
б =12:4-|- 4; B; e сұ су cy | Bch = 18.62 
; 1 9 10.7 73 -34 -L8 -51 | Bey = 5.04 
(iv) 2 15 137 143 2.5 TOR 
3 133 nmi 15.7 0.9 0.6 32 | Ec} = 39.86 
€. с; 
2 | wie |. ЗИТ 389. LES goal 
| bi 2 2 bı bz bs ы % bs 
a|-312 208 104| 16-12-39) 196 131 65, 00 
(0 а | 230-153 —7.7| -<86 5.7 29|-144 96 48 0.0 
а | 82—55 -27| -30 20 10|-— у, EA E 200 
0.0 00 00| 00 00 00| 00 00 00 
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The comparison associated with nonadditivity is a weighted sum of the 
entries in the cell of part i, the weights being the corresponding entries in 
party. Thus the comparison used in the test for nonadditivity is 


(—31.2)(3) + (20.8)(6) + (10.4)(9) + +++ + (1.7)(12) = 60.60. 
Тһе sum of squares for this comparison has the form 
(60.60)* 
Хауа” 


SSnonaaa = 
The divisor is given by 
Elce c)! = (—31.2)* + (20.8)! + «++ + (1.7)® = 3742. 
Within rounding error, the following relation must hold: 
Жаса = (EXE). 
The right-hand side of this last expression is 
(18.62)(5.04)(39.86) = 3740.7. 
The analysis of variance is summarized in Table 6.8-3. 


Table 6.8-3 Summary of Analysis of Variance 


Source of variation 55 df MS F 
A 172.67 2 
B 44.67 2 
с 360.67 2 
AB 173,33 4 
AC 4133 4 
BC 3.33 4 
ABC " 6.67 8 
Nonadd 0,98 1 0.98 1.21 
Balance 5,69 7 0.81 


The test for nonadditivity is given by 
0.98 
F = — = 
081 1.21. 


The critical value of this statistic for a .25-level test is F,,(1,7) = 1.57. 
Since the observed value of the F statistic does not exceed thir toa value, 
there is no reason to reject the hypothesis of additivity, The evidence from 
this test supports the hypothesis that the components of the three-factor 
interaction are һо . 1 the assumptions underlying the test are 
met, the component for nonadditivity would tend to be large relative to the 
other components, provided that the three-factor interaction estimated a 
source of variation different from experimental error. In this case, the 
three-factor interaction may be considered as an estimate of experimental 
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error (granting the validity of the assumptions). Hence the three-factor 
interaction term may be dropped from the model. In the latter case, М5,,, 
provides an estimate of o7. 


6.9 Computation of Trend Components 


Computational procedure for trends for the case of a single-factor experi- 
ment were discussed in Sec. 3.7. These procedures generalize to factorial 
experiments. Principles underlying this generalization were discussed in 
Sec. 5.18; the actual computation of trend components in a factorial experi- 
ment are considered in this section. Computational procedures will be 
illustrated for the case of a 3 x 4 factorial experiment having five observa- 
tions in each cell. 

It will be assumed that (1) both factors are fixed, (2) the levels of both 
factors represent steps along an underlying quantitative scale, and (3) the 
respective levels represent equally spaced steps along the respective scales, 
The latter assumption, which is not essential to the development, permits a 
simplification of the numerical work, since coefficients of orthogonal poly- 
nomials may be used to obtain desired sums of squares. For this case, the 
coefficients for the levels of factor A are as follows (see Table B.10): 


Linear: c; 


-2 


-1 | 1| 1 
Quadratic: с) 1 


The coefficients for the levels of factor В аге as follows: 


Linear: су -3 | -1 1 
Quadratic: c7 1 -1 -1 
Cubic: сў -l 31-3 1 


The data given in part i of Table 6.9-1 will be used as а numerical example. 
[Assume that each entry in the AB summary table is the sum of five observa- 
tions; data for the computation of symbol (2) are not given.] The analysis 
of variance is summarized in part iii, This particular analysis does not 
necessarily give the experimenter all the information he secks, There are 
many other ways in which the over-all variation may be analyzed. 

In spite of the significant interaction, it may be of interest to study the 
trend components of the main effects. For illustrative purposes, the trend 
components of the B main effect will be obtained. Тһе comparison associ- 
ated with the linear component is a weighted sum of B, totals, the weights 
being the linear coefficients. For data in part i of Table 6.9-1, the linear 
comparison is 


Cun = (—3)(19) + (—1)(21) + (1028) + (3Х37) = 61. 
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Table 6.9-1 Numerical Example 


AB summary table (n = 5): 


a 3 5 9 14 31 
(i) аз 7 11 1577 20 53 
аз 9 5 4 3 21 
(97-521 28 57 105 

(1) (1) - 183.75 (3) - 210.55 (5) - 247.40 

(2) - 280.00 (4) - 197.00 
| 
Source of variation SS df MS jay 

A 26.80 2 13.40 19.71 

(iii) B 13.25 3 4.42 6.50 

AB 23.60 6 3.93 5.78 
Within cell 32.40 48 0.68 


The linear component of the variation due to the main effects of factor B is 


Cia Ор 
npè(c;)? 5(3)(20) 
Computational formulas for the quadratic and cubic components of the B 
main effect are summarized in Table 6.9-2. Ші noted that the linear com- 
ponent accounts for 12.40/13.25, or 94 per cent, of the variation due to the 
main effect. This means that on the average over levels of factor A the 
criterion measure predominantly is a linear function of levels of factor A. 


58) = 


Table 6.9-2 Trends of B Main Effects 


6 d -1 1 3 Ха» = 20 
(i) су 1 -1 =I 1 Хо»- 4 
су -1 3 -3 1 Ж(с/')# = 20 


_ (В) _ (61)? 
SS = nPE = 303020) = 12.40 


ii 2c B) (7) 
(ii) Sm - CGB) _ ЧЛ а 
(апаа) me? Syd) 0.82 
SB) (372 
55 сшыс) – (298) СЭ) = 0.03 


mpc} 5(3)00) 
55, 132 
-UEN arr ы. 


л 
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A test on whether a trend component differs significantly from zero uses the 


statistic 
pee MStrena 


MSy. сеп 3 
For example, a test on the quadratic trend is given by 


Fa Мӛмше, 082 2454 
MSy.cett 0.68 


The critical value for a .05-level test is F,,(1,48) = 4.04. Hence the 
experimental data indicate that the hypothesis of no quadratic trend in 
the main effect of factor B is tenable. 


Table 6.9-3 Trends within AB Interaction 


Linear x Linear 


bi bs bs bs SS; = Ed, (ABE 
= ————— linxlin = n(Edi;) 


() &| ОТОО ОС аш _ 15.68 
а| -3 -і 1 3 5(40) 


Quadratic x Linear 


| b, b, bs b, _ [Ed (АВР 
= - SSquad xlin = er 
^ 4| 35 rl RN, 5s 
(ii) а 6 2 -2 -6 2-60. 726 
a | —3 1 -1 3 5(120) 


Linear x Quadratic 


| b b b b _ [94 (AB 
aay DD SSin x quad = ET o7 


а|-і 1 1 -1 
di) ^ e| 0 02 (0 0 20280 
аз 1 -1 =1 1 5(8) 


A significant interaction implies that the response surface for different 
levels of factor B (or A) is not homogeneous, i.e., that profiles are not 
parallel. The linear x linear, linear x quadratic, etc., components of 
interaction indicate the fit of variously shaped surfaces, ie., different 
patterns of profiles. Computational procedures are summarized in Table 


69-3. The weights for the linear x linear comparison are given in parti. 
An entry in cell ab,; of this table has the form 


d; = cfc}. 
dy =(—-1(-3) =3, dg—(-DC-D—L 
dy, = (0)(—3) = 0, da = (0)(—3) = 0. 


For example, 
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The weights for the quadratic x linear comparison are given in part ii. 
An entry in this latter table is given by 

dy = іс, 
(Although the symbol dy is used for the typical entry in different tables, the 
context will make it clear which table is meant. A more explicit, but more 
cumbersome, notation for the latter dy would be d.) The entries in 
part ii are obtained as follows: 


dy = (1X3) = 3, da= (1(—1) = —1, 
dy (203) = —6, 4,-(-2Х-1)-2, 
dy = (1X3) = 3, da= (1Х—1). = 1. 


tional formulas for some of the trend components of the AB 
interaction are summarized in Table 6.9-3. With suitable definition of din 
the other components have the same general form. In each case d,, refers 
to an entry in a different table of weights. Ofthe total variation due to AB, 
the linear x linear component accounts for 15,68/23.60, or 66 per cent. 
The sum of the linear x linear component and the quadratic x lincar 


components accounts for 


15.68 -- 7.26 
in." 
of 97 per cent. Tests on trend components of the interaction have the 
following. general form: MS 


0 Sean ont mt ~ UR ~ SET, - 95 — 2 - 4j 


Sect = ЖШ ~ SEE = 11 — 03 - 0 
58, = 2360 
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It is sometimes of interest to study differences in trends for the sim 
effects of one factor at different levels of a second factor. For illustrative 
purposes, differences between trends for the simple effects of factor B at 
different levels of factor A will be obtained, (The degrees of freedom for 
such differences in trend are p — 1 = 2 for each trend.) Computa- 
tional formulas for these sources of variation are given in Table 6.9-4. 


Figure 6.8 Profiles of factor B at levels of factor A. 


In this table the symbol d; is defined as follows: 
d; = SeKAB,,). 
For example, d 
dj = (303) C7) + 00) + (314) = 37. 
The symbols d; and d” are defined as follows: 
4 = хив). 
d; = > (AB,,). 


The variation due to differences in linear trends in simple effects of 
factor B explains 23.39/23.60, or 99 per cent, of the total variation of the 
AB interaction. This means that 99 per cent of the AB interaction arises 
from differences between the linear trends in the profiles of factor В at the 
different levels of factor 4. These profiles are shown in Fig. 6.8. If these 
profiles were plotted in a three-dimensional space, the 
represented by the AB summary table would be obtained. This response 
Surface is shown in Fig. 6.9. From this surface it is seen that profiles for 
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factor A at fixed levels of factor B are predominantly quadratic in form. 
However, the profiles for factor B at fixed levels of factor A tend to be linear 
inform. Hence the shape of this surface is predominantly quadratic x 
linear. 


Figure 6.9 Response surface. 


6.10 General Computational Formulas for Main Effects and 
Interactions 


The following notation will be used in this section. 


А, = sum of all observations at level a,. 
па = number of observations summed to obtain A,; n, is assumed constant 
for all levels of factor A. 
АВ, = sum of.all observations at level ab,;. 


Mas = number of observations summed to Obtain AB,;; п,, is assumed 
constant for all ab, s. 


АВС,» = sum of all Observations at level абс. 


Mawe = number of observations summed to obtain АВС; ry, is assumed 
constant for all abc, "s. 


In terms of this notation, a general formula for the main effect due to factor 
A is 
2 2 
,-54 E 
n, n, 
where G is the grand total of all observations and 7, is the number of obser- 
vations summed to obtain G. (In this context an Observation is a measure- 


ment on the experimental unit.) A general formula for the main effect due 
to factor B has the form 
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The general computational formula for the variation due to the AB inter- 
action is 
Dy 2 2 
56. лыы abes spes 
Nav п, 
The general computational formula for the variation due to the AC inter- 
action has the form 
2 2 
ss, = С — €. — (ss, + sS). 
n 
(Unless otherwise indicated, the range of summation is over all possible 
values of the terms being summed.) 
The general computational formula for variation due to the ABC inter- 
action has the form 
2 2 
SS, = АВС _ © _ (ss, + $$, + SS, 4-55, + SS, + 88,2. 
Nave Ng 
The computational formula for the variation due to the UVW interaction is 


2 2 
ss, ОРИ? _ 0 _ (ss, + SS, + SS, + SSw + SSuv + 58м). 


uvw 
n ите п, 


Variation due to a four-factor interaction has the general formula 


2 2 
SS, — СУХУ 6 _ (ss, +88, + SS, + SS) 


ШТ Ny 


— (SS, + $$ + SSuz + 58, + 55, + 85) 
— (SSwww + SSuve + 58» + 89). 
In a factorial experiment having / factors there are 


( 5 two-factor interactions, 


( | three-factor interactions, 


( Г ) m-factor interactions. 
m 


For example, in a four-factor experiment, the number plrpossible жо» 
factor interactions is 
(%) BPO EN 


2/ 10) 
In à four-factor experiment the number of possible three-factor interactions 


18 
(9 BLOOR 
3/. 10/3) 
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Alternatively, given a /-factor interaction, the formulas that have been given 
above indicate the number of different two-factor, three-factor, etc., inter- 
actions that may be formed from the z letters in the ¢-factor interaction, 
The general formula for variation due to a five-factor interaction is 
2 2 
SS us, = ШИИ _ @# O | (ду (ip + (дуу 
where (i) = ESS nam emcr—there are five terms in this sum; 
(ii) = Х55,, пеш int—there are [5(4)]/[1(2)] = 10 terms in this sum; 
(ii) = б ае еге are [5(4)(3)]/[12)(3)] = 10 terms in this 
sum; 
(iv) = ASS our fastor еге are (5(4)(3)(2)]/[1(2)(3)(4)] = 5 terms in 
this sum. 
The general formulas given above assume that no factor is nested under 
any other. In case factor C is nested under factor B, the general formula 
for the main effect due to C within Bis 


y 2 2 
55, = ЖАСУ td A 
Noe n, 
ст 55, іг 58,» 
where SS, is the variation due to the main effect of factor Cif the nesting is 
disregarded and SS,, has an analogous definition, The general formula for 
the A x (C within B) variation is 
SS, wh) = АВС = ЖАву E = всу ate EE 
Nave Nay Nye п, 


= SS,. + SSares 


Where SS, is the three-factor variati i ing dis- 
тейм i aie ton computed with the nesting dis 


SS, io э.) = 55, + SS, 


where the terms on the right are computed as if there were no nesting. 


For the case in which factor C is nested und i 
гий ам ЗОО under factor В and factor D is 


5840, bande) = X(Bc py . Хвсу 
Lo ль. 
= SS, + SS, + SS, + SS... 
The terms in this last line are computed as if there were no nesting. Also, 
SSaaw. bande) = XaBcpy s3 Хавсу . ХВвсру T X(BCy 
Pesca Mave тм Nye 
ow 55,, T SS, M 55, + 55... 
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The degrees of freedom for the variation on the left are the sum of the 
degrees of freedom of the variations on the right. The degrees of freedom 
may be checked as follows: 


df = pqrs — pqr — qrs +9" 
= pars — 1) —ar(s — 1) 
= 45 = Vp = 1). 
The terms in the first line of the above expression аге obtained as follows. 


pars = number of observations in an ABCD, 
pqr = number of observations in ап ABC, 
qrs — number of observations in a BC D, 
pq = number of observations in a BC. 


6.11 Missing Data 


For a factorial experiment in which the cell frequencies are not equal but 
all n,/s are approximately equal, the analysis by the method of unweighted 
means presents no particular problems, provided that all cells contain at 
least one observation. Intheunweighted-means analysis, the mean of each 
cell is estimated by the observations actually made within that cell. Іп 
cases where the number of observations within а particular cell is small 
relative to the number of observations in other cells in the same row and 
column or adjacent rows and adjacent columns, then information provided 
by these other cells may be used in estimating the mean ofa specified cell. 
In an unweighted-means analysis, such information is not utilized in the 
estimation process. | 

Іп ап experiment having no observed data in given cells, estimates of such 
cell means may have to be obtained from other cells in the experiment, | If 
the form of the response surface were known, estimates of the missing 
entries could be obtained by using а multiple regression equation, | This 
solution is generally not practical. One method, having somewhat limited 
utility, is to estimate the missing mean in cell jj by the following formula, 

Ав, = Aj B; — G^ 
where A’ is the mean of the observations actually made under level а; 
similar definitions hold for В, and G'. This method of estimating missing 
data assumes no interaction present. Because this method of estimation 
does not take into account trends in the row and column of the missing 
entry, its utility is limited. For the case of a three-factor experiment, the 
method that has just been described takes the form 
АВС, = Ai + B, C — 26’. 

The presence of interaction effects is а stumbling block in all methods for 
estimating missing cell entries. A review of methods that have been pro- 
posed and an extensive bibliography on the topic of missing data are given 
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in Federer (1955, pp. 124-127, 133-134). Caution and judgment are called 
for in the use of any method for estimating missing data. Each of the pro- 
posed methods has a set of assumptions that must be considered with care. 

Consider the numerical data given in Table 6.11-1. These data represent 
the available observations їп а 5 x 4 factorial experiment. Assume the 
levels of the factors represent steps along a quantitative scale. There is a 
single measurement in 18 of the 20 cells. Entries in cells ab», and аб, are 
missing. The unknown values of these cell entries are designated u» and 
им. [The data in this table appear in Bennett and Franklin (1954, p. 
383)] A method of estimating missing observations, which minimizes the 


Table 6.11-1 Numerical Example 
by b, bs b, Total 


a, 28 20: At 10 69 

а иу 16 15 8 39 + toy 

аҙ 29 13 16 изд 58 + из, 
66 


аз |12817 15 10 64 
112 703075 39 296 us F uy 


Cáceres АЕО RU a i M a 


interaction effect in the analysis resulting when such estimates are used in 
place of the missing data, is described by Bennett and Franklin (1954, pp. 
382-383). Using this method, Bennett and Franklin arrive at the following 


estimates: 
из = 28.6, ug, = 10.2, 

If these two estimates of the missing entries are inserted in their proper 
places and the usual analysis of variance computed as if all data were 
observed, the variation due to AB will be a minimum. In essence, this 
principle for estimating the missing entries keeps the profiles of the simple 
effects as parallel as possible. 

A simplified version of this latter principle utilizes information from only 
those cells adjacent to the cell in which data are missing. For example, 


us (D + 48 


Н 16 D NR 3 
Solving for u», gives 
Us, = 29.0, 


This method of estimation assumes that the slope of the profiles for levels 
b, and b, at level а; is the mean of corresponding slopes at levels a, and 
аз. Thelatter two slopes can be obtained from theobserved data. Analo- 


gously, 
Us _ (15) + (4%) 
2 


6 
from which из = 92. 
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To use the first method discussed in this section for estimating missing 
entries, one proceeds as follows: 
ADEM ИК198: 
112980, B/=9.8; 
G' = BS = 164. 

In each case the denominator is the number of observations in the corre- 
sponding total. The estimates of the unknown entries are 

ug, = 13.0 + 28.0 — 16.4 = 24.6, 

ug = 19.3 + 9.8 — 164 = 12.6. 
If the analysis of variance were to be carried out with estimated values 
substituted for the missing values, the degrees of freedom for the resulting 
two-factor interaction would be 

df,, = (p — 1)(q — 1) — (number of missing values). 


i 
\ 


For the numerical example їп Table 6.11-1, 
df,, = 43) — 2 = 10. 

By way of summary, there are mathematically elegant methods for esti- 
mating missing cell entries. None of these methods is satisfactory unless 
the experimenter has information about the nature of the response surface 
being studied. There is, however, no real substitute for experimental data. 
In the example given in Table 6.11-1, the experimentally determined values 
of the missing entries were 

изу = 23, Ung = 24. 
None of the methods considered yielded values relatively close to the 
observed value for и. 


6.12 Special Computational Procedures When All Factors Have 
Two Levels 
When each factor in a factorial experiment has two levels, the computa- 


tional formulas for main effects and interactions may be simplified. For 
the case of a 2 x 2 factorial experiment having n observations per cell, 


ss, (A A 
4n 
ss, = Gi 29, 
4n 
ss, — [4Bu + АВы) — (AB + ABS 


4n 

In the expression for the interaction, note that the sum of the subscripts of 
each term in the first set of parentheses is an even number, whereas thesum 
of the subscripts for each term in the second set of parentheses is an odd 
number. 
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For the case of a2 x 2 x 2 factorial experiment having observations 
per cell, 


S A 
SS, m 
(В, — В) 
SS, = > 
5, 8п 
ss. — (a= a)? у 
Я 8п 
SS ПАВ, + АВ.) — (АВ, + AB»)? 
Th 8n 
SS [(4Cu + АС») — (АС, + ACS)P 
ej 8n 
Б (ВС, + BCs) — (ВС, + BCs)? 
be > 
8n 


[(АВС + АВС, + АВС, + АВС») (АВС + АВС, ч 
+ АВС,ц + АВС,»)| 4 


SS 


abc 


8n 


Again note that in expressions for interactions the sum of the subscripts for 
each term within a pair of parentheses is an odd number in one case andan 
even number in the other case. 

The computations for a three-factor experiment may be made without 
obtaining two-factor Summary tables by following the scheme given in 
Table 6.12-1. 

The ABC summary table is given in part i. The symbol si in part i’ 
Tepresents the sum of the entries in the first column of part i; 5), s4, and s; 
Tepresent sums of entries in the respective columns of parti. These sums 
should be arranged as indicated in parti’. The symbol d/ represents the 
difference between the two entries in the first column in part i, that is, 
4 = ABO АВС. The symbols 4%, d;, and d; represent corre- 
sponding differences between entries in corresponding column in part i. 
For example, d; — АВС,» — АВС». 

The entries in part i” are obtained from the entries in part i’ in the same 
general manner as corresponding entries were obtained from parti. That 
is, 

=, + s, 53 = di + dj, 
Sus peo MUST 
Also, (8 ааа d;, 


ТЖ NEUE " 
dy = 55 — 5, dj = d; — di. 
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The entries in part i” are obtained from the entries in part i” by means of the 
same general pattern. 


m 


=i ts, = di + dy 
4-%%% Pa B+ di, 
p 4-4-% dy = di а 


а" =sy—sy dy = d; — dy. 


Table 6.12-1 Computational Scheme for 2 x 2 x 2 
Factorial Experiment 


| сі | C2 
a | ааг. 
1 by — | bg bs 
(i) j = рт. 2-4 
а ABCu АВС ABC АВС,» 
a, | АВС, АВС | ABC: АВС 
(1) (2) (3) (4) 
с ————— 
] ‚ ] j 
ы Sy 53 di а 
E Hm qu ME: 
SS Ени 
Й 51 54 di d; 
y b MUN: 
T E] st а" а" 
e) Keine ine 
SS, = (s; [8n SS, = (dj )°/8п 
(ii) SS, = (s3/)°/8n 55, = (2: )°/8п 
SS, = (5{ )°/8п SS,, = (dy')?/8n 


55. = (4, [8n 


Pe SS SS oo 


Computational formulas for the sums of squares are giveninpartii. These 
computational formulas are identical to those given earlier in this section. 
A numerical example of this computational scheme is given in Table 6.12-2. 
As a check on the computational work, 


ХАВСУ G 
п 8п 


SSpetween cells = 


= X(main effects) + X(two-factor int) 


| X(three-factor int). 
For the data in Table 6.12-2, 


SShetween'celis = 625.00 — 525.62 = 99.38. 
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Table 6.12-2 Numerical Example 


сі % 
(i) by by b, ba- (-5 
a 10 15 20 10 
az |1720 30 30 10 
(1) (2) (3) (4) 
@) 30 502 ы (0) 
45 20 —15 0 
а oer mter MERE 
à") 15, "22 "ШУ 5 
70 —10 30 —10 
же еее бармын” е” ащ _ 
à") 145 15 5 —45 
-35 -5 -15 15 
SS, = (—35)?/40 = 30.62 SS, = (5)?/40 = .62 
(i) SS, = (15)/40 = 5.62 SSac = (—15)?/40 = 5.62 
55 = (-5)/40 = .62 55, = (—45)?/40 = 50.62 


SSine = (15)°/40 = 5.62 
=т= a ee 

Within rounding error, this is the numerical value of the sum of the varia- 
tions computed in part ii. 

_ The computational scheme that has just been illustrated may be general- 
ized to апу 2" factorial experiment, where К is the number of factors. The 
generalization to a 24 = 2 x 2 x 2 x 2 factorial experiment is illustrated 
in Table 6.12-3. In part i the ABCD summary table is represented sche- 
matically. In part i’ each s’ represents the sum of the entries in the corre- 
sponding columns of part i; each d’ represents the difference between the 
elements in the corresponding columns of part i. Entries in part i” are 


ed from the entries in the columns of part i’ in an analogous manner. 
at is, 


МЕ " 
=s +9,  di—s—s; 


B= d rd, dg = d; — dy, 
This procedure continues until part i* is completed, in this case until part 


bos 2 
i" is completed. Computational formulas for the variation due to the 


sources indicated in part ii may be obtained from the corresponding entry in 
parti". For example, 
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Table 6.12-3 Computational Scheme for 2t Factorial Experiment 


а, | а, 
СА t3 [i eg 
by bz ы by b by ы bz 
(i) 
a 
а 
(1) (2) (3) (4) (5) (6) (7) (8) 
МИЗ: оов eat OS л vn Rae ee 
(0) ie % % 5; di dy & 
A 54 56 5% а; d; а di 
à") СА sy sp 57 di ау di Ят 
52 54 56 54 а; di dg dg 
ж s s s? st а" а" а" а" 
sit sit st st а" а" 1" а" 
d» giv sy iv сМ div div d ІН diy 
> sly siy sy OM ау di* ау ау 
2 — B G BC D BD CD BCD 
2 A AB AC ABC AD ABD ACD ABCD 


The general formula is (өзі iS (еу 


2% 25n 


This computational procedure is particularly useful when k is 4 or larger. 


6.13 Illustrative Applications 


In a study reported by Wulff and Stolurow (1957) the experimental plan 
was a2 x 2 factorial having 10 subjects per cell. Factor A was essentially 
a classification factor; its levels indicated the aptitude of the subjects as 
measured by a test of mechanical aptitude. The levels of factor B were the 
methods of instruction used in a paired-associates learning task. The 
criterion was the number of correct responses in a block of trials. The 


analysis of variance had the following form: 
Тее Lie oe ee 


Source df MS Ё 
A (aptitude) 1 2175.62 17.96 
B (method) 1 931.22 7.68 
AB 1 9.02 
Within cell 36 121.10 
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Factors A and B were both considered to be fixed; hence MS,, ee Was the 
denominator for all F ratios. The factor of primary interest was B; the 
critical value for a .05-level test on this factor is Е os(1,36) = 4.11. The 
experimental data indicated that there is a Statistically significant difference 
between the two methods of instruction with respect to the mean number of 
correct responses in the specified blocks of trials, 

Gordon (1959) reports a4 х 4 factorial experiment in which both factors 
A and В were considered to be random. The levels of factor A represented 
the kind of task on which subjects were given pretraining; the levels of factor 
B represented the kind of task to which the Subjects were transferred. The 
design may be represented schematically as follows: 


DAY WE LIE 
a, Gy Gy 47 Gu 
а Gy б б» бы 
а n Gs; Gay бу 


4 Gy 97 ба [27 
The symbol G,, represents the group of subjects given pretraining under 
task jand transferred to taskj. The criterion measure was the performance 
on the second task. The primary purpose of the study was to measure 
possible effects of transfer of training; the latter would be indicated by the 
magnitude of the interaction effects, 
The analysis of variance in the Gordon study had the following form: 


Source | df MS Е 
A (first task) 9077 14 
B(second task) 3 | 470 32.38 
AB 9 14 3.82 
Within cell | 144 | оз 


The analysis given above represents only part of the analysis; separate 
analyses were made for different stages in the learning of the second task. 


Since both factors A and B were considered random, the F ratios had the 
following form: 


F = MS, = M5, = _М%_ 
MS,, MS,, MSw cots” 


These are the F ratios which would be obtained if one were to derive the 
expected values of the mean squares. The critical value for a .05-level test 
on the interaction is Ё (9,144) = 1.94. (In the Opinion of the author the 
factors in this kind of experiment should be considered fixed rather than 
random; the study did not indicate the randomization procedure whereby 
the tasks used in the experiment were Obtained. Rather, the indications 
were that the tasks were deliberately chosen to meet specified objectives.) 

A 6 x 4 x 3 factorial having only one observation per cell is reported 


EE EEE a „ШИ 
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by Aborn etal. (1959). These writers were interested in studying the effects 
of contextual constraints upon recognition of words in sentences, The 
observation in each of the cells was a mean. А square-root transformation 
was used before the analysis was made. Entries in three of the cells were 
missing and were estimated from the data in the remaining cells. All the 
interactions were pooled into a single error (егт. The analysis of variance 
had the following form: 


Source df MS F 
A (class of word) 5 4.44 26.28 
B (position) 3 47 2.76 
С (sentence length) 2 .54 3.22 
Pooled interactions 3 . 


The pooled interaction term would have 61 degrees of freedom if there 
were no missing data; since 3 of the 72 cell entries were missing, degrees 
of freedom for the resulting pooled interaction are 61 — 3 = 58. The 
separate interaction terms have the following degrees of freedom: 


AB: $x3215 
AC: 5x2=10 
BC: 3x2= 6 


ABC: 5х3х2-3-27 


Bamford and Ritchie (1958) report an experiment which may be герге- 
sented as follows: 


Subject Ln 
ЫЙ ҰНЫ UNE 


ne 


9 
NER ee SS ee 

The levels of factor A represent control and experimental conditions for 
turn indicators on an airplane instrument panel. The levels of factor 
B represent successive trials under each condition. Each of the nine sub- 
jects was observed under each of the levels of factor А as well as each of the 
levels of factor B. Тһе order in which trials were made under the level of 
factor A was counterbalanced. ' | b 

This design, as well as the others which follow in this section, actually falls 
in the repeated-measures category. The usual tests on main effects and 
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interactions are valid only under a set of highly restrictive assumptions on 
covariance matrices. Checks on these assumptions have, for the most part, 
been ignored by experimenters in the behavioral sciences. The usual tests 
tend to err on the side of yielding too many significant results (positive bias) 
when homogeneity assumptions on covariance matrices аге not met. A 
test procedure which avoids this kind of bias is discussed in Sec. 7.2. In 
experiments involving a learning or practice effect, homogeneity conditions 
required for the usual F tests to be valid are generally not present. 

In form, this plan may be considered as a special case of a2 x 4 x 9 
factorial experiment. The subject factor is considered random; factors 4 
and Bare considered fixed. Assuming that all interactions with the subject 
factor may be pooled, i.e., that all interactions with the subject factor are 
independent estimates of experimental error, the analysis of variance has the 
following form: 


Source df Ms | Fr 
Subjects 8 11.83 
A 1 20.77 5.82 
B 3 11.42 3.19 
AB 3 9.14 2.56 
Pooled error 56 3.57 


2---- ЕРЕН ee itt 
Gerathewohl et al. (1957) report a study having the form of a2 x 9 x 9 

factorial experiment. The purpose of this experiment was to study the 

effects of speed and direction of rotation upon the pattern of circular eye 


Table 6.13-1 Summary of Analysis of Variance 


Source df MS ! d 

С (subjects) 8 30.80 

A (direction) 1 47.22 3.41 

АС 8 13.84 

B (speed) 8 85.66 17.8 

BC 64 4.80 

AB 8 1.75 63 
ABC 64 2.77 

Total 161 | 
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movements. Factor A represented the direction of rotation; factor B 
represented the speed of the rotation. Factor C represented a subject 
factor. The order in which Subjects were observed under the levels of 
factors A and B was the same for all Subjects. The subject factor was 
considered random, but factors 4 and B were considered fixed. The 
expected values of the mean squares for this design are analogous to those 
obtained in Table 6.5-3 when homogeneity assumptions discussed in Sec. 


UNEQUAL CELL FREQUENCIES—LEAST-SQUARES SOLUTION 291 


7.2 are met. The analysis of variance reported by these workers is sum- 
marized (with slight modification) in Table 6.13-1. 

Inspection of the interactions with the subject factor indicates that no 
pooling is possible. Hence the F ratios have the following form: 


F М9; 5 Е MS, F М5, 


MS,, MS, MS, 


Jerison (1959) reports a study which is a special case ofa 2 x 3 x 4 x 9 
factorial experiment. The purpose of this experiment was to study the 
effects of noise on human performance. The levels of factor A represented 


Table 6.13-2 Summary of Analysis of Variance 


Source df MS F 
Subjects 8 6544.90 
A (noise conditions) 1 8490.08 2.93 
A x subjects 8 2900.09 
B (periods) 3 479.67 6.32 
B x subjects 24 75.89 
C (clocks) 2 489.31 1.24 
C x subjects 16 396.08 
AB 3 600.47 3,48 
AB x subjects 24 172.60 
AC 2 280.52 1.18 
AC x subjects 16 238.36 
BC 6 138.63 1.32 
BC x subjects 48 105.09 
ABC 6 253.10 2.07 
ABC x subjects 48 122.13 

Total 215 


_—_——-—<—-+— 


control and experimental noise conditions. The levels of factor B repre- 
sented four successive periods of time under each of the conditions of factor 
A. The levels of factor C represented three clocks monitored by the sub- 
jects during the course of the experiment. The last factor represented a 
Subject factor. The criterion was an accuracy score on each of the clocks. 
All factors except the subject factor were considered to be fixed. The 
analysis of variance is summarized in Table 6.13-2. In each case the 
denominator for an F ratio is the corresponding interaction with the 
subject factor. 


6.14 Unequal Cell Frequencies—Least-squares Solution 


The data in Table 6.14-1 will be used to illustrate the computational pro- 
cedures associated with the least-squares estimates of the sums of squares. 
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The rationale underlying these procedures was discussed in Sec. 5.23. 
Definitions of the basic symbols are also given in the latter section. 

These data have been taken from Anderson and Bancroft (1952, p. 243). 
The levels of factor A represent sex of animal; levels of factor B represent 


Table 6.14-1 Numerical Example with Unequal Cell Frequencies 


Cell frequencies: 


by b, ba b, Total 
а ny = 21 ng = 15 пуз = 12 na = 7 ‚= 
(7 | niue 2T п» = 25 Nog = 23 Noy = 19 К 
пу = 48 пз = пз = 35 па = 26 = 
—————— жуыла” ЖҰРТЫ 2 
Cell totals: 
by by bs b, 


а | ABy —3716 AB, = 2422 АВ, —1868 AB, = 1197 A, = 9203 
QD а | ABy —2957 AB, —2852 АВ, —2496 AB. —2029 A, — 10334 
В, = 6613 В, = 5274 В,-4%4 — B,—326 С = 19537 


(1) = бп. — 2,561,707 
(2) = хх? = 2,738,543 
(iii) (3) = X(At/n;)) = 1,539,913 + 1,136,080 = 2,675,993 
(4) = X(B?[n )) = 2,567,463 
(5) = (АВ? п) = 2,680,848 


four Successive generations of animals. Іп this type of experiment, the cell 
frequencies are, in a real Sense, an integral part of the design. Hence a 
least-squares analysis is more appropriate than an unweighted-means 


Table 6.14-2 Direct Computation of 55,44) 


Cell means: 
bi by bs b, 


aı| 176.95 161.47 155.67 171.00 

a| 109.52 114.08 108.52 106.79 

d; 67.43 47.39 47.15 64.21 

м 11.81 9.38 7.88 5.12 Хи, 
wj;d;| 796.35 444,52 371.54 328.76 wd, 


34.19 
1941.17 


ІСІ 


analysis, (Іп part iii, with the exception of symbol (2), data from which 
the numerical values of the symbols are computed are given in parts i and ii. 
The raw data for symbol (2) are not given. 

When one of the factors has only two levels, the simplest approach (see 
Rao, 1952, pp. 95-100) is to obtain 85,5445, directly from part i of Table 
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6.14-1 and from Table 6.14-2. In the latter table, 
4,- AB, — АВ, апа  w,— us. 


niy + пы 
2 
PECO aa 

48 


For example, 


The adjusted sum of squares due to interaction is given by 


(2w,d;)° 


W; 


SSuaj = Хи — 


The first term on the right is most readily obtained from the terms w;d;, that 
is, 
Уу = а (иа) + d(wads) +--+ + dy(wydy) 


= (67.43)(796.35) + (47.39)(444.52) + >>. + (64.21)(328.76) 
= 113,391. 
1941.17): 
Thus, adj 391 - 3180. 
5ӛақаар = 113, 34.19 


Table 6.14-3 Summary of Analysis of Variance (Least Squares) 


Source of variation 55 df MS 
A Sex 110,205 1 110,205 
B Generations 1,675 3 555 
AB 3,180 3 1,060 
Error 57,695 141 409.2 


From part iii of Table 6.14-1, the unadjusted sums of squares аге 
SSceus = (5) — (1) = 119,141, 
SS, = (3) — (1) = 114,286, 
SS, = (4) — (1) = 5756. 


The adjusted sum of squares due to factor A is 


SSacaaiy = $$сепв — SSan(aaiy — SS, = 110,205. 
Similarly, ^— SS) = SSeens — Sanaa — 55, = 1675. 


From part iii of Table 6.14-1 one obtains 
SSerror = (2) — (5) = 57,695. 


A summary of the analysis of variance is given in Table 6.14-3. Tests 
follow the same pattern as that of the usual factorial experiment in which 
model I is appropriate. 


294 COMPUTATIONAL PROCEDURES AND NUMERICAL EXAMPLES 


The direct computation of SS,,,5 will be illustrated by data in Table 
6.14-4. This procedure, or a modified version thereof, is the one generally 
followed in a p x q factorial experiment when the smaller of p and q is 


Table 6.14-4 Computation of 55) 44)) 


Cell frequencies: 
b, ba bs b, Total 


a on Кее, РАН ӘРЛЕУ 
© aa ЛЫМ OS a E SA 4, = 10394 
85 AOIS 2 TA 
CESSES 75:277 «07 ай oa ctl Й 
b, by by b, "n 
b, | 322265 ~12.9081 —11.1882 -81302 19085 
aU 2926020 —9,3897 ^ —69623 1569 
by 261542 -61762 —17246 
b, 21.2687 —34.08 
EAN eR S s: NR 
bi | 32.2265 12.9081 — —111882 ^ —81302 19085 
bf | 1000 4005 —.3472 —2523 5.9219 
ama 240905  —13,8706 ^ —102184 92.1254 
ы 10000 -.5758 —4M22 3.8241 
Д 14.8830 —148828 —53.1511 
bi 1.0000 —10000  —3.5713 
SSyaaj = (190.85)(5.9219) + (92.1254)3.8241) + ( 53.1511) 3.5713) 
= 167234 


БЕ ы ан Су 


greater than 2. The entries in part i are the cell frequencies. The entries 
in part ii to the left of the B; column are the т) For example, 


nh "ы, (2) (27) 


ny =ny— = 32.2265, 

Мі Sia red ey 55 94 

п [ | "алы ы | enes) 12.9081, 
iis Ts; 55 94 

na = - [tute алам |. (ола), CDEL i tgp, 
i л». 55 94 


As a numerical check, 
Уп, = 0. 


For example, Уп» = —12.9081 + 29.2602 — 9.3897 — 6.9623 — 0. 
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The entries in column В; of part iii are the adjusted totals for the levels of 
factor B; for example, 


B; — B, Е ) 4} 6673 [= Сказ) 


n. Ny, 94 t 
= 190.85, 
BL = By Ес s ы 5274 [2 Д 
n. Ny, 55 94 
= 15.69. 


As a check on the numerical work ХВ; = 0. 

Part iii represents what is known as the abbreviated Doolittle algorithm. 
Row b; is identical to row b; in part ii. Entries іп row bj are obtained by 
dividing each entry in row Ру by the first entry in the row, which is 32.2265. 

Entries in row bj are obtained from the following relation: 

by; F by; F^ Бы» jz23, 
where b; is the entry in row 55, column Б, and bə; is the entry in row be, 
column Б. For example, 
bi, = bog — bibis = 29.2605 — (—.4005)(—12.9081), 
bj, = bas — bibis = —9.3897 — (— 4005) — 11.1882). 
Entries in row b% are obtained by dividing each of the entries in row b by 
the entry Б», which in this case is 24.0905. Д 
Entries іп row Б; аге obtained from the following relation: 
by; = by, — Барі; — 5з» J 23. 
For example, bj, = Бы — bibis — биби 
6.9623 — (—.3472)(—8.1302) — (—.5758)( 10.2184) 
= —14.8828. 
Entries in row b% are obtained by dividing each entry in row bj by bss, which 
is 14.8830. 1 
In general, an entry in row 5; is given by the relation 
bj; = bi; — bybi; — buba — 7 bib» 
and an entry in row bf is the corresponding entry in row bi divided by bi; 

When the abbreviated Doolittle algorithm is used in this manner, there 
is a simple check on the work: the sum of the entries in row bi, excluding the 
entry in column В), will be zero. If there are q levels of factor B, the last 


row in the algorithm will be 5; 1. 2 
The numerical value of $$) is obtained from the В; column of the 


abbreviated Doolittle algorithm. 
SSiaap = Lbinbin- 
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Having 55, ау» the adjusted interaction sum of squares is given by 
$5амаа) = SScens — SSa — $$маа})» 
and the adjusted sum of squares due to the main effect of factor A is 
SSacaaj) = SSa + 55қаар — 55) 
= SScens — SSab(adj) — 590. 
By way of contrast the unweighted-means analysis is given in Table 
6.14-5. In this case the two approaches lead to comparable end products. 


Table 6.14-5 Unweighted-means Analysis 


Source of variation SS df. MS 
A Sex 155,760 1 155,760 
B Generations 1,965 3 655 
AB 2,721 3 907 
Error 57,695 141 409.2 


Although the abbreviated Doolittle algorithm is rather widely used, a 
somewhat simpler computational scheme is that known as the Dwyer 
Square-root algorithm, The latter may also be used to solve the normal 
equations in the general linear-regression problem. The data in Table 
6.14-6 will be used to illustrate the Dwyer algorithm in obtaining SS;,5. 
The starting point is the data given in part ii of Table 6.14-4, 


Table 6.14-6 Computation of SSj(,4;; 


(Dwyer square-root algorithm) 
1 -- а SO ree Quei. дарсы 3 M NNNM 


b, b, by by в” 

bı | 322265 —12.9081  —11.1882 -48.1302 190.85 
by 29.2602 9.3897 —69623 15.69 
by 26.7542 -61762 -172.46 
by 21.2687  —34.08 
bi | 5.6769 | —22739  —19708 -14322 33.6187 
bi 49081  —2.8262 — —20821 18.7721 
by 3.8578 —3.8579 13.7774 


SSivaajy = (33.6187)? + (18.7721)? + (—13.7774)2 = 1672.43 
далана Ere T _, 


The entries in row Б) are 
b = Vou, by = wy SST 
For example, 


bi, = У32.2265 = 5,6769; bi, — — 12.9081 
5.6769 
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The entries in the row 5j are defined as follows: 


bia = Nba — WU ip ЗА? 
bos 


For example, b}, = 4/29.2602 — (—2.2739)? = 4.9081 
—9.3897 — (—2.2739)(— 1.9708) 


ба in 2.8262. 
The entries in row 5; are defined as follows: 
bz; = Уа — bis — bis; 
p, = Pu Ваи Вау рз 
bs 
In general, the entries in row b; are defined as follows: 
bi, = УЫ, = XA. where К = 1,2,...,i— 1; 
bi; T Хр; 
Ше тА; » леге ж 


ii 
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Entries under the B; column are considered in the same category as any 


other column, 


To obtain SS, a, the algorithm terminates after 4 y 1 rows are 
complete. The adjusted sum of squares for factor Bis obtained from the 
Bj column of the algorithm from the relationship shown in the last line of 
Table 6.14-6. As a check on the computational work, the sum of the 
entries in each b’ row (excluding the entry under column Bj) is zero (within 


rounding error). 


CHAPTER 7 


Multifactor Experiments Having 
Repeated Measures on the Same Elements 


7.1 General Purpose 


Factorial experiments in which the same experimental unit (generally a 
subject) is observed under more than one treatment condition require 
special attention. Experiments of this kind will be referred to as those in 
which there are repeated measures. Single-factor experiments of this kind 
were discussed in considerable detail in Chap. 4. 

Different approaches have been used in handling the sampling distribu- 
tions of mean squares which arise in the analysis of experiments. These 
approaches differ to some extent in the definition of the experimental unit. 
Under one approach, which assumes a multivariate normal underlying dis- 
tribution in the population, the experimental errors are considered to be 
correlated. Under an alternative approach, a nested random factor (a kind 
of dummy variable) is included in the model to absorb the correlation be- 
tween the experimental errors. This latter approach has the advantage of 
permitting a relatively simple derivation of the expected values of the mean 
squares. The disadvantage of the latter approach is that it tends to mask 
homogeneity assumptions (specifically, constant correlation between pairs 
of observations) required to Justify the sampling distribution of the final F 
ratio used in making tests. Both approaches lead to identical F ratios when 
the homogeneity assumptions are met. A comprehensive discussion of the 
homogeneity assumptions underlying repeated measure designs will be 
found in the work of Bargmann (1957). 

A two-factor experiment in which there are repeated measures on factor 
B (that is, each experimental unit is observed under all levels of factor B) 
may be represented schematically as follows: 


b b b 
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The symbol С; represents a group of n subjects. The symbol б; represents 
a second group of z subjects. The subjects in С; are observed under treat- 
ment combinations 48,4, aby), and аһ. Thus the subjects in G; areobserved 
under all levels of factor B in the experiment, but only under one level of 
factor 4. The subjects in G, are observed under treatment combinations 
aby, abs, and abs. Thus each subject іп б is observed under all levels 
of factor B in the experiment, but only under one level of factor A, 
namely, а». 

In this kind of experiment, comparisons between treatment combinations 
at different levels of factor A involve differences between groups as well as 
differences associated with factor 4. On the other hand, comparisons 
between different levels of factor B at the same level of A do not involve 
differences between groups. Since measurements included in the latter 
comparisons are based upon the same elements, main effects associated with 
such elements tend to cancel. For the latter comparisons, each element 
serves as its own control with respect to such main effects. 

In an experiment of the type represented above, the main effects of factor 
A are said to be completely confounded with differences between groups. 
On the other hand, the main effects of factor B as well as the AB interaction 
will be shown to be free of such confounding. Tests on B and AB will 
generally be considerably more sensitive than tests on the main effects of 
factor 4. Where no confounding with the group factor is present, there are 
fewer uncontrolled sources of error variance. The smaller the error vari- 
ance, the more sensitive (powerful) the test. 

By using the approach which assumes that the errors are correlated, the 
expected value of the variance due to the main effects of factor A has the 
general form 

E(MS,) = o2[1 + (4 — Dp] + ngos 


where p is the (constant) correlation between pairs of observations on the 
same element and q is thenumber of levels of factor B. The expected 
value of the mean square appropriate for thedenominator of an F ratio used 
in testing the variance due to the main effect of factor A has the form 


«ІП + (9 — Әрі. 


For sources of variation which are not confounded with the main effects 
due to differences between groups, the denominator of an F ratio has the 
expected value 

e; = р). 
Thus, if correlation between pairs of measurements is positive and con- 
stant, the latter experimental error will be smaller than the former. Hence 
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the greater sensitivity (i.e., the greater power) of tests using the latter error 
term. 

In terms of the alternative approach, which postulates the existence of a 
nested random factor, the expected value for the denominator of an F ratio 
for a test on the main effects of factor A has the form 


в + 402, 


where o? is the variation due to the main effects of subjects within the 
groups. The expected value for the denominator for tests on effects which 
are not confounded with main effects due to subjects within the groups has 
the form 


mee 
0; + Spa» 


where o5, is the interaction between the subject and treatmentfactors. The 
magnitude of o7, is generally considerably smaller than o2. (It should be 
noted that 62 includes more sources of variance than does о.) 

Repeated measures оп the same elements may arise in different ways. Іп 
experiments designed to study rates of learning as a function of treatment 
effects, repeated measures on the same subject are a necessary part of the 
design. Further, the order in which the observations are made is dictated 
by the experimental variables. On the other hand, in experiments designed 
to evaluate the joint effect of two or more treatments the experimenter may 
have his option as to whether or not the same elements are observed under 
more than one treatment combination. Further, the order in which the 
elements appear under the treatment combinations may also be under the 
control of the experimenter. The utility of designs calling for repeated 
measures is limited where carry-over effects are likely to confound results. 
In Some cases such effects may be controlled by counterbalancing the order 
in which treatment combinations are given to the elements. (Designs 
of this kind are discussed in Sec. 10.7.) 

Aside from designs having the form of learning experiments, the primary 
purpose of repeated measures on the same elements is the control that this 
kind of design provides over individual differences between experimental 
units. In the area of the behavioral sciences, differences between such units 
often are quite large relative to differences in treatment effects which the 
experimenter is trying to evaluate. If there are no carry-over effects, a 
repeated-measures design in the area of the behavioral sciences is to some 
degree analogous to a split-plot design in the area of agricultural experi- 
mentation. Where additive carry-over effects are present, repeated-meas- 
ures designs are analogous to crossover designs. ; 

Another (somewhat doubtful) advantage of a repeated-measures design is 
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in terms of economy of subjects. Using different subjects under each of the 
treatment combinations in a factorial experiment has the marked advantage 
of providing statistically independent estimates of treatment effects from all 
cells in the experiment. Increasing the number of Statistically independent 
observations is very likely to be the best way of increasing the precision of 
estimators. By having each subject serve as his own control, the experi- 
menter attempts to work with a smaller sample size. However, the simple 
additive model underlying the usual analysis for the case of repeated 
measures may not be an adequate representation of the experimental phe- 
nomena. A more inclusive multivariate regressionmodel is often required 
to represent fully the underlying experimental variables. 

In sections that follow, special cases of factorial designs having repeated 
measures will be illustrated. The expected values of the mean square for 
these special cases will be given. The latter are obtained by means of the 
methods given by Cornfield and Tukey (1956). The validity of the tests 
Set up on the basis of such expected values rests upon assumptions about 
the form of the variance-covariance matrix associated with the joint multi- 
variate normal distribution of the variables in the model. Unless the 
nature of the experimental variables dictates the order in which treatments 
are administered to subjects, it will be assumed that the order of administra- 
tion is randomized independently for each of the subjects. Further, it will 
be assumed that the elements in a group are a random sample from a 
specified population of elements. 

A strong word of warning is required in connection with order (or se- 
quence) effects. Practice, fatigue, transfer of training, the effects of an 
immediately preceding success or failure illustrate what fall in the latter 
category. If such effects exist, randomizing or counterbalancing does not 
remove them; rather, such procedures completely entangle the latter with 
treatment effects. There is some chance that sequence effects will balance 
out—they generally will if a simple additive model is realistic. However, 
in experiments (other than those which primarily are concerned with 
learning or carry-over effects) where the sequence effects are likely to be 
marked and where primary interest lies in evaluating the effect of individual 
treatments in the absence of possible sequence effects, a repeated-measures 
design is to be avoided. 4 

In cases where sequence effects are likely to be small relative to the 
treatment effects, repeated-measures designs can be used. Counter- 
balancing or randomizing order of administration in this case tends to 
prevent sequence effects from being completely confounded with one or just 
a selected few of the treatments. Instead such sequence effects are spread 
Over all the treatment effects. Admittedly such sequence effects may serve 
to mask treatment effects; however, the potential advantages can outweigh 
the potential disadvantages. 
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72 Two-factor Experiment with Repeated Measures on One Factor 


This kind of experiment was illustrated in the last section. The general 
case may be represented as follows: 


b, TM Ау, E b, 
ау G, yu бі %1” бі 
Ae хе сал Ө РГЕ а СА 
ШІ! берү» с, 


Each С represents а random sample of size п from а common рорша- 
tion of subjects. Each of the subjects in G; is observed under 4 different 
treatment combinations, all of these treatment combinations involving 
factor А at levela; The actual observations on the subjects within group / 
may be represented as follows: 


Subject by ee b; cere b, 
1 Зоо pt e Cr qae 7 
а; k Хақ: Хик сезе Хак 
n am e гай Хат е Xin 


The symbol Хик denotes a measurement оп subject k in С, under treatment 
combination ab;. А more complete notation for subject k would be K(i); 
the latter notation distinguishes this subject from subject k in some other 
group. Similarly a more complete notation for Y;;, would be Хуу. The 
latter notationis rather cumbersome; in cases in which there is no ambiguity, 
the symbol Ху will be used to indicate an observation on subject К in G; 
made under treatment combination ab,,. 

For special cases the notation for a subject may be made more specific. 
For example, consider the case in which p=2,q = 3, andn=2. The 
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experimental data may be represented as follows: 


| Subject b, by by 
a | 1 Xu Xi Xii 
2 12 Xiz 132 


S 
ж 


In this notation scheme subject 3 is the first subject in б, and subject 4 is 


the second subject in б,. 
The linear model upon which the analysis will be based has the following 
form: 


LA | 
Хк = U + 08 +T + By + «у + Bri + Erin 


The notation т, indicates that the effect of subject is nested under level 
а. Note that the linear model does not include any carry-over effects. 
The analysis of variance for this kind of experiment takes the form given in 
Table 7.2-1. The expected values shown in this table assume that 4 and В 
are fixed factors. 

Table 7.2-1 Summary of Analysis of Variance 


Source of variation df E(MS) 

Between subjects пр -1 

А p-1 оё + qo, + ngoa 

Subjects within groups рп = 1) аё + 491 
Within subjects тр(4 - 1) 

В 4-1 92 + ой, + про) 

АВ (р = 09 = 1) a2 + of, + по 

B x subjects within groups рп — 104—1) | È +98, 


a 


The manner in which the total variation is partitioned in this table is 
quite similar to that used їп ар x q factorial experiment in which there are 
No repeated measures. A comparison of the two partitions is shown in 
Table 7.2-2. It will be noted that partition of the between-cell variation 
is identical. However, in an experiment having repeated measures, the 
within-cell variation is divided into two orthogonal (nonoverlapping) parts. 
One part is a function of experimental error plus the main effects of subjects 
within groups, i.e., individual differences. The other part is a function of 
experimental error and В х subject-within-group interaction. If the latter 
interaction is negligible, then the second part of the within-cell variation is a 
function solely of experimental error. 
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Table 7.2-2 Comparison of Partitions 


p х q factorial р х q factorial 
(no repeated measures) (repeated measures on factor B) 
Total npq -1 Total npq —1 
Between cells ра д Between cells pq—1 
A jp A mad 
B 1 В 4-1 
БАН (p =1Xg:=1) AB PDN 
Within cells рап — 1) | Within cells pan — 1) 
Subjects within 
groups p(n — 1) 
B x subjects within 
groups pn — 1)(4 —1) 


Appropriate denominators for F ratios to be used in making statistical 
tests are indicated by the expected values of the mean squares. Thus, to 
test the hypothesis that 62 = 0, 


MS, 


MSsubj w. groups 


The mean square in the denominator of the above F ratio is sometimes 
designated Мб оь wetween) То test the hypothesis that o5 = 0, 


MS, 


3 М5в x subj w. m 
To test the hypothesis that c2, = 0, the appropriate F ratio is 
5 MS 


MS; x subj w. groups 


ab 


The mean square in the denominator of the last two F ratios is sometimes 
called MS, (уныш) Since it forms the denominator of F ratios used in 
testing effects which can be classified as part of the within-subject variation. 

The mean squares used in the denominators of the above F ratios repre- 
sent a pooling of different sources of variation. The variation due to sub- 
jects within groups is the sum of the following sources of variation: 


Source df 
Subjects within groups p(n — 1) à 
Subjects within G, T 
Subjects within G, шр 
Subjects within G, е! 
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One of the assumptions required in order that the F ratio actually follow 
an F distribution is that these sources of variation be homogeneous. A 
partial check on this assumption may be made through use of the statistic 
maximum (SSsupj w. @,) 
Fmax = Sa CP MEI 
minimum (SSsupj w. с) 
i.e., the ratio of the largest of these sources of variation to the smallest. The 
critical value for this statistic in a test having level of significance equal to « 


1S 
Fmaxq (р, n — 1). 


These critical values are given in Table B.7. 
The variation due to B x subjects within groups represents a pooling of 
the following sources: 


Source df 
B x subjects within groups pi — Ig — 1) 
B x subjects within G, (n - 1/4 — 1) 
В x subjects within б, (п — 1/4 - 1) 
В x subjects within G, (п — Ig = 1) 


A test on the homogeneity of these sources is given by 


maximum (SSz x subj w. G,) 


Fmax 
minimum (SS; x subj w. б,) 


The critical value for this test is 
Fimaxca—a)Lp, (1 — 1)(4 — 1)]. 

If the scale of measurement for the original criterion data does not satisfy 
these homogeneity assumptions, a transformation may often be found which 
will Satisfy these assumptions. Indications are, however, that the F tests 
given above are robust with respect to minor violations of these assump- 
tions [see Box (1954)]. E = 

For the sampling distribution of the F ratio for within-subject effects to 
be the F distribution with the usual degrees of freedom requires additional 
assumptions about the pattern of elements іп q X 4 covariance matrices. 
(The latter covariance matrices appear in a later section of this chapter.) 
A method for making this check is illustrated in the work of Greenhouse 
and Geisser (1959); a numerical example is given in Sec. 7.7. These 
workers also provide a simple approximation procedure which avoids 
assumptions about equal covariances in the pooled variance-covariance 
matrix. The approximation procedure given by these writers errs on the 
Side of making the critical value somewhat larger than should be the case. 
(Hence the test procedure is negatively biased, since it will yield too few 

Significant" decisions.) 
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In the Greenhouse and Geisser procedure, the F ratios given above are 
used, but the degrees of freedom used in finding the critical values are 
adjusted. To test the hypothesis that o7 = 0, the critical value for the F 
ratio is 

F,_,[1, p(n — 1)] instead of.  F,_,[(q — 1), p(n — 1)(q — 1)]. 
To test the hypothesis that ož; = 0, the critical value is 
Е,_„[(р — 1), p(n —1)]  insteadof Р,_„[(р — 1)(4— 1), p(n — 1)(4 — 1)]. 


Use of these modified critical values is recommended if there is reason to 
doubt that the covariances between pairs of observations are constant 
for all levels of factor B. Transformations on the scale of measurement 
which yield homogeneity of the error terms which are pooled do not neces- 
sarily yield the required homogeneity conditions on the covariances. 
Computational Procedures. To illustrate the computational procedures 
for this plan, consider a factorial experiment in which the levels of factor A 


Table 7.2-3 Numerical Example 


Observed data: 


Subject | b, by bs b, Total 
1 0 0 5 3 8 =P, 
a 2 З 1 5 4 13 = P, 
, В 4 3 6 2 15 = P, 
®© 
4 4 2 7 8 21 — P, 
а; 5 5 4 6 6 21=Р, 
6 7 5 8 9 29 — P, 
107 =G 
Баа адланыр анчир анн 207 — 
AB summary table:. 
by by bs b, Total 
(ii) a T 4 16 9 36 
ag 46) ESDI 1:23 71 
234171514 3678 099/32. 107 
eee ee алым сз з... 
Computational symbols: 
(1) = G*/npq = (107)?/3(2)(4) = 477.04 
Q) = Ух? =0 +0454... 482492 = 615 
did (3) -(“А т = (36° + 71°)/3(4) = 528.08 


(4) —GBj)n = (23° + 15° + 37° 329/30) = 524.50 
(5) = С204В,)]п = (7? + 4? + --- + 212 + 233)/3 = 583.00 


(6) —CPD|g = (8° + 13? + --- + 21? + 292)/4 = 545.25 
a E i vi 
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are two methods for calibrating dials and the levels of B are four shapes 
for the dials. Suppose that the data obtained are those given in part i of 
Table 7.2-3. Entries are accuracy scores on a series of trials on each of the 
dials. Thus, for this experiment p = 2, д =4,andn = 3. The order in 
which subjects are observed under the dials is randomized independently. 

From the data in part i, the AB summary table given in part ii is readily 
obtained. Computational symbols are defined and computed in part iii. 
The only symbol that does not occur ina р X q factorial experiment with- 
Out repeated measures is (6). This latter symbol involves Р,, which is the 
sum of the q observations made on subject k. These sums are given at 
the right of parti. In each case the divisor in a computational symbol is 
the number of observations summed to obtain an entry which is squared 
in the numerator. 


Table 7.2-4 Analysis of Variance for Numerical Example 


Source of variation Computational formula | SS | df MS F 
Between subjects - Хд (6) — (1) = 68.21 5 
A (calibration) | (3) - (1) = 51.04) 1] 51.04 | 11.90 
Subjects within groups (6 — (3) = 17.17 | 4 4.29 
Within Subjects (2) — (6) = 69.75 18 
B (shape) (4) - (1) = 47.46] 3 | 15.82 | 12.76 
AB (5) - (3) -(4)--(1) = 746| 3| 249 | 201 
2 1.24 


Bx subjects within groups (2) — (5) — (6) + (3) = 14.83 | 1 
group: 


The analysis of variance is summarized in Table 7.2-4. In terms of 
means, 


SSsunj w. groups = «ХХ =, Ay, 
кі 
SSp x subj w. groups = УУ У(Х == Ри) = В, al A)’. 
kit 


The computational formulas given in Table 7.2-4 are equivalent to these 
Operations on the means. 

The differential sensitivity for tests on between- and within-subject effects 
should be noted, The denominator of the F ratio for the between-subject 
effects is 4.29, whereas the denominator of the F ratios for the within- 
Subject effects is 1.24. It is not unusual in this kind of plan to find the 
Tatio of these two denominators to be as large as 10:1. If the level of 
Significance for tests is set at .05, one rejects the hypothesis that о? = 0, since 
Fos(1,4) = 7.71; the hypothesis that оў — 0 is also rejected, since 
Е,(3,12) = 3.49. However, the experimental data do not contradict 
the hypothesis that 022—0: 

If the experimenter has reason (often there is reason) to question the 
homogeneity of the covariances in the underlying population, the critical 
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values for the within-subject tests are as follows: 
eee eee 


Hypothesis | Conservative test | Ordinary test 
оў =0 Е gs 1,4] = 7.71 Е 933,12] = 3.49 
оёр =0 Е gj1,4] = 7.71 Е 0513,12] = 3.49 


iin Baht а oa ads ec Oe ? 
Even under the more conservative test (i.e., negatively biased test) the main 


effects due to the shapes of the dials remain statistically significant. 

If the experimenter has reason to question the homogeneity of the parts 
that are pooled to form the denominators of the F ratios, a check on homo- 
geneity would logically precede the tests. Computational procedures for 
partitioning the relevant sums of squares are given in Table 7.2-5. А sym- 
bol of form (6a,) has the same general definition as (6), but summations 


Table 7.2-5 Partition of Error Terms 


pl 
(6а) = (EPI = (8° + 13? + 152)/4 — 114.50 
(баз) = (УР = (21? + 212 + 29%)/4 = 430.75 
(6) = 545.25 
(За) = (Ap)/ng = 36°/3(4) = 108.00 
Gay) = (43)/nq = 71*[3(4) = 420.08 
н (3) = 528.08 
(i) : 
ба) = [Ж(АВ,у°]п = [7° + 42 + 16? + 923 = 13400 
в 
(баз) = [У(АВ„у°]п = [16° + 112 + 212 + 2321/3 = 449.00 
Е (5) = 583.00 
Qa) = XX* = 02 + OF 4... 62 42 = 150 
Qa) = EX? = 42 22... gh 92 = 465 
3 (2) = 615 
SSsupjw.@, = (бау) — (3a,) = 6.50 
SSsubj w, с, = (баз) — (3a;) = 10.67 
(ii) Km ai 


1747 
SSB xsubj w. с, = (а) — (бау) — (бау) + (3a,) = 9.50 
553 xsubj w. д, = (2а) - (Say) — (6а) + (3а) = 5.33 


14.83 


к= эз == = „ыыы „АА 


are restricted to level а. The computational procedures for parts which 
are pooled are given in part ii. As a check on the homogeneity of 
55,4 w. groups» 

6.50 — 
The critical value for a .05-level test here is 


Fmax. „(2,2) = 39.00. 
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Since the computed Fmax statistic does not exceed the critical value, the test 
does not contradict the hypothesis that the parts are homogeneous. Since 
each of the parts in this case has only two degrees of freedom, the power 
of a test of this kind is extremely low. 

As a check on the homogeneity of the parts of SS, „ 


max = = 1,84, 


5.33 
Fmax „(2,6) = 5.82. 


Again the computed value of the statistic does not exceed the critical value. 
Hence the hypothesis of homogeneity is not contradicted by the experi- 
mental data. 

Tests on the difference between all possible pairs of means can be made 
in a manner similar to that given in Sec. 3.8. The procedure is illustrated 
in Table 7.2-6. The Newman-Keuls method is chosen for this purpose. 


subj w. groups? 


The critical value here is 


Table 7.2-6 Tests on Means Using Newman-Keuls Procedure 


Shapes .... by by b, bs 
Ordered means.... | 2.50 | 3.83 | 5.33 | 6.17 
(i) by b, b, bs 
D 1.33 2.83 3.67 
Differences bi 1.0 234 
between pairs 0, 84 
зр = 46 peo 3 4 
Gi) 9.4012): 308 3.77 420 
ғ 9.12): 12173193 


(ii) ^ qoe 
ы ж ж 
b, T 


In part i the B,’s are arranged in rank order from low to high. Differences 
between all possible pairs of ordered means are computed. For example, 


6.17 — 2.50. — 3.67, 5.33 — 2.50 — 2.83, etc. 


In part ii critical values for the ordered differences between pairs are com- 
Puted. Since the main effect of factor B is a within-subject effect, the 
Standard error of the mean for all observations at a given level of factor B is 


ee UE x subjw. groups __ и w J207 Sis 
2 пр 6 
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The degrees of freedom associated with this standard error are those of 
MS; x subj w. groups, Which in this case are 12. To obtain the critical value 
for the difference between two ordered means which are r steps apart in an 
ordered sequence, one first finds the tabled values for 


91-27, dferror), 


where df error represents the degrees of freedom associated with Sp. These 
values are obtained from Table B.4. For level of significance .05, the 
relevant values of q are given in part ii. The critical value for an ordered 
difference betwween two means ғ Steps apart is 


5р41-а(ғ,Ф ог). 


These critical values also appear in partii. (For the sequence in which tests 
on ordered pairs must be made, see Sec, 3.8.) 

The pairs of means which can be considered different are indicated in part 
iii. The mean performance оп shape №, is statistically different from the 
mean performance on shapes b, and b, The mean performance on shape 
b, is also statistically different from the mean performance on shapes 8; 
апа b,, No other differences are Statistically significant at the .05 level for 


Tests on all possible ordered differences of the form 4, — А, follow the 
Same general pattern. For such tests, 


зд= ГМ groups м 
nq 


the variation due to the simple main effect of factors 4 and B is identical to 
that of a two-factor factorial experiment which does not have repeated 


Promien To test the simple main effect of factor B, the F ratio has the 
orm 


ра М5, ata, 
сыч ad i адыр 


М5, X subj у, groups 


factor A has the form 


М3, сеп : 


The denominator of this last F ratio Tequires special note—it is not the 
denominator used in testing the main effects of factor 4. For each level of 
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factor В considered individually, this plan reduces to a single-factor experi- 
ment in which there are no repeated measures. In this latter type of experi- 
ment М5, .. is the appropriate denominator for the variation due to the 
treatment effects. 

The within-cell variation is given by 


SSw. cell = SS subj w. groups + 55в x subj w. groups: 


Within the context of a repeated-measures design, SS, en represents a 
pooling of what will often be heterogeneous sources of variance. Hence 
the F test on the simple main effects for factor A, which uses MS,,, cn as a 
denominator, will tend to be biased. However, when the degrees of free- 
dom for the within-cell variation are large (say, greater than 30), the bias 
will be quite small. The magnitude of the bias depends in part upon the 
ratio of MS, w. groups to MS; x subj w. groups. 

Variation due to the simple main effects is most readily computed from 
the АВ summary table given in part ii of Table 7.2-3. For example, 


MSa nto, = = = 18:50: 
p 


The denominator for the appropriate F ratio is 
SSw.cen _ 17.17 + 14.83 _ 5 99 


MSy, cell 


pq(n—1) 16 
TE М5ашь _ 6.75. 
MS. сеп 


The degrees of freedom for this F statistic are [(р — 1), рд(л — 1)]. The 
Variation due to the simple main effects for factor B at level а; is 


2 2 
P4 16 +9 36 _ 26,00; 


SSpata, 


3 12 
26.00 
MSpata, = SSpata, _ 26.00 _ 9.67. 
"ese 3 
A test on the simple main effects of factor B at level a, uses the statistic 
М5, ata, _ 8.67 _ 6.99. 


MSz x subjw.groups 1.24 
The critical value for a test having level of significance .05 is 


F [04 — 1), p(n — 1)(4 — 1)] = F 3,12) = 3.49. 


312 MULTIFACTOR EXPERIMENTS HAVING REPEATED MEASURES 


Hence the experimental data tend to reject the hypothesis that there are 
no differences in the effects of factor B when all observations are made at 
level a,. 


Covariance Matrices Associated with this Design. Consider the follow- 
ing data from part i of Table 7.2-3 


Subject | b, by bs b, 

1 0 0 5 3 

а, 2 3 1 5 4 

3 4 3 6 2 

7 4 16 9 
c—————À o ETT Y RE 


The variance of the observations made under Б, is 
2 2 2). (432 
vary, — (+ 3° + 42) — (77/3) 
2 
Similarly the variance of the Observations made under b, is 
2 2 2 2 
var, = © + 1 + 35) — (47/3) 
2 
The covariance of the observations made under b, and Б, is 


соу, = (0X0) + (3)(1) ze» = (X43 _ 


4.33. 


= 2.33. 


2.83. 


Similarly, the covariance of the Observations under b, and b; is 


cov, = (0X5) + (3)(5) 7 (4)(6) — (7)(16)/3 = 83. 
The variance-covariance matrix for le 


€ vel a, of factor A is given below (only 
the top half is completed since the bot 


tom half is a duplicate of the top half): 


b, b, 8 by 
БҮ 74935 95195 153 03-0650 
в, | bs 2.33 83 100 
bs 85 — — 59 
b, 1.00 


nce matrix for the data at leve] a, is given below: 


b, by bs b, 
СИ АЙНИ Ree 
в, | 5 2.33 150 33 
zu Beg 100 150 
b, 


2:33 
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In order that the F ratio on within-subject effects follow an F distribution, 
the population variance-covariance matrices must have the following form: 


|Ж b, bs b, 
bi а? ра? ро? ро? 
bs а? ра? ра? 
bs өз ро? 
b, | o 


In words, each of the elements along the diagonal is equal to 07, and each of 
the covariances is equal to po?. It is further assumed that о? and p are the 
same for all levels of factor A. In this context p is the product-moment 
correlation between pairs of measurements on the same subject. 

If the underlying population variance-covariance matrices for В, and 
В, are equal but not necessarily of the form indicated above, the best esti- 
mate of the common underlying population variance-covariance matrix is 
the pooled sample matrices. The pooled matrix is obtained by averaging 
corresponding entries in the individual matrices. For the data under con- 
sideration, the pooled variance-covariance matrix is given below: 


b, by bs b, 

By. | 13,33,2250. 92 33 

B bs 2.33 67) = 38 
poole py 67 150 
by 1.67 


There are two stages in testing for homogeneity with respect to the 
variance-covariance matrices. These tests are illustrated in Sec. 7.7. 
First, one is interested in finding out whether or not ће B,,’s can be pooled. 
Second, if these matrices can be pooled, one is interested in finding out 
whether or not the population matrix estimated by the pooled matrix has 
the required symmetry, i.e., all diagonal elements equal to o°, and all off- 
diagonal elements equal to po?. In terms of the pooled variance-covariance 
matrix, 


Мӛн x subj ж. groups — var SCN 
For the data under consideration, 
Var — cov = 2.00 — .76 = 1.24. 


The term var is the mean of entries along the main diagonal of the pooled 
Variance-covariance matrix; cov is the mean of the entries off the main 


diagonal in this matrix. Also, 


var + (д — 1) cov = 2.00 + 3(.76) = 4.28. 
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From Table 7.2-4 it is noted that 
MSsubjw. groups = 4.29, 
MS x subj w. groups = 1.24. 
The computational formulas for these terms, which are given in Table 7.2-4, 
are thus short-cut methods, The long method, however, indicates what has 
to be pooled along the way; the long method also produces summary data 


that are useful in interpreting the experimental results, 
Illustrative Applications. Many applications of this form of the 


task. The criterion score was the number of correct symbols recognized in 


a series of very short exposures. The plan of this experiment may be repre- 
sented as follows: 


SSS > ла 


| Dynamometer tension 
Anxiety level 


By | ty De x 


ay G, б, б, G, G, С, 
а G, G, G, 2 б; G, 
а G G G G 6; о, 


Each of the Subjects was tested under all levels of dynamometer tension. 


df MS F 
Between subjects 17 
ee zt 
A (anxiety level) 2 742.7 1.58 
Subjects within groups 15 469.0 
Within subjects 90 
Sees кз 
В (tension) 5 138.7 3.3974 
AB 7 10 127.6 З12** 
B x subjects within groups 75 40.9 
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The significant interaction indicated that the profiles for the groups had 
different shapes. Shore interprets this interaction in terms of the group 
profiles. Note the magnitude of the ratio of the between-subject error 
term (469.0) to the within-subject error term (40.9). A ratio of 10:1 is not 
unusual in this area of experimentation. 

Another application of this same basic experimental plan is reported by 
Noble and McNeely (1957). This study was concerned with the relative 
rate of paired-associates learning as a function of degree of meaningfulness 
of the pairs. The experimental plan may be represented as follows: 


Degree of meaningfulness 


List 

by ba 8 b, bs 
a G, G, бі бі бі 
а б, б, б, б, б, 


aig Gig біз Gis Gis б 


Each of the 18 lists had pairs for each of the meaningfulness categories. 
There were five subjects in each of the groups. Differences between groups 
of subjects are confounded with differences between the lists; however, 
primary interest is in the meaningfulness dimension of this experiment. 
The lists essentially represent samples of material to be learned. The 
analysis of variance had the following form: 


df MS F 
Between subjects 89 
A (lists) 17 311.88 1.20 
Subjects within groups 72 260.34 
Within subjects 360 
B (meaningfulness 4 3046.72 | 52.72** 
AB È | 68 82.91 1.43 
B x subjects within groups 288 57.79 
ES oo ooo арс ЗА ыы с. 


Тһе analysis clearly indicates that degree of meaningfulness of the paired 
Associates is related to the rate of learning. Р ) 

А study reported by Schrier (1958) provides another illustration of the use 
Of this basic plan. In this study the experimenter was interested in evaluati ng 
the effect of amount of reward on the performance of monkeys in a series 
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of discrimination problems. The criterion used was per cent of correct 
choices in two periods—a period was a block of 20 consecutive trials. The 
plan of the experiment may be represented as follows (only one phase of a 
more extensive experiment is summarized here): 


аал 53... 
Period 
Level of reward 
by b, 
a бі G, 
а б, б; 
аз бз бз 
а G, G, 
cr e aem эмы vun Mi cii 


There were five subjects in each group. 

In designs in which there are only two measurements on eachsubject (and 
the order in which the measurements are made is not at the option of the 
experimenter) there is only one covariance. Hence the problem of homo- 
geneity of covariance does not exist. The analysis of the within-subject 
effects in this case is equivalent to an analysis of difference scores between 
the two periods, 

A scale of measurement in terms of рег cent generally does not provide 
homogeneity of variance, Schrier used an arcsine transformation on the 
Percentages, and the analysis of variance was made on the transformed 
criterion scale. А modified version of the analysis is given below: 


df 


MS Е 
Between subjects 19 Т 
А (rewards) °З 148.6 2.52 
Subjects within groups 16 58.9 
Within subjects 20 
B (periods) 1 685.8 59.12** 
АВ 3 1.9 
B x subjects within groups 16 1.16 


i, lee M 


this chapter. 


Another example of the same basic experimental plan is reported by 
Denenberg and Myers (1958). This Study was designed to evaluate the 
effects of thyroid level upon acquisition arid extinction of a bar-pressing 
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response. Schematically the acquisition phase of this study had the 
following form: 


Days 
Thyroid level 
КЕ bys 
| 
а | G G, G, 
а Gz G: б, 
аз Gz б, Gs 


There were four subjects in each of the groups. The criterion measure was 
the number of responses in a Skinner box during a specified time period. 
The analysis of variance had the following form. 


df MS Е, 
Between subjects п 
A (thyroid level) 2 56,665 13,324* 
Subjects within groups 9 4,253 
Within subjects 204 
B (days) 17 594 249** 
AB 34 1,190 4.98** 
B x subjects within groups 153 239 


Again note the ratio of the between-subject error term (4253) to the within- 
subject error term (239). f 

Birch, Burnside, and Clark (1958) report an experiment which had the 
following form: 


Blocks of trials 


Deprivation |. 
interval ы be bs 
F 5 б, б, б, т = 10 
& б» бі СЕ 4-10 
а, б; б; б; пз = 10 
а G, б, С, Tai 9 


There were 10 subjects in each of the groups except бі; there were 9 subjects 
in the latter groups since 1 subject had to be discarded in the course of the 
experiment, 
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Speed of running was used as measure of response strength. The study 
was designed to investigate response strength as a function of deprivation 
interval. The analysis of variance had the following form: 


df MS F 
Between subjects 38 
A (deprivation interval) 3 788.06 3.34* 
Subjects within groups 35, 236.29 
Within subjects 78 
B (trials) 2 | 79.92 1.17 
AB 6 56.70 
B x subjects within groups 70 68.24 


———————MM M ÁÉÁÉ dL É BÉ 


In cases of missing data of this kind, where the missing data are the result of 
conditions unrelated to the treatments, estimates of variation due to treat- 
ment effects should be obtained through use of an unweighted means 
analysis. 

General Form of the Expected Values of the Mean Squares. Following 
the procedures described by Cornfield and Tukey (1956), the expected values 


for more general cases of the plan under consideration are given below. In 
the notation used, 


= р 4 п 
D. d реу nem 
2 Р Q Hs N 


Thus each of the D's is either 0 or 1, depending upon whether the corre- 
sponding factor is fixed or random, respectively. 


zoa o e ril КШ 


Effect i і k m Expected value of mean square 

0; р, q n 1 о? + Dn о), + nD4025 4-40,03 + то 
ШҮ 1 4 D, í ор + DO, + qD,o? 

Ё; рако 1 of + Род, + nD,o25 + проў 

«В; Dy, Dy n 1 оў + Род, + n025 

Вто І Di. р. 1 оў + Род, 

mk) 1 1 1 1 вр 


——————Áu— MM 


The expected values for the mean Squares for the case in which factors A and 


B are fixed and the subject factor random are obtained by setting D, — 0, 
D, = 0, and D, = 1. 
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7.3 Three-factor Experiment with Repeated Measures (Case I) 


Two special cases will be considered. The first case will be that of a 
p Xq X r factorial experiment in which there are repeated observations on 
thelast two factors. Inthe second case, repeated measures will be restricted 
to the last factor. A schematic representation of the first case is given 
below: 


| by b, 

| е €, сі с, 
бі б, Үү, б, р G, T бі 
az б, WA Gy Ye б, 47% Gy 
а, G, in G, he б, S G, 


There are n subjects in each group. Each of the subjects is observed under 
all gr combinations of factor В and C but only under a single level of factor 
A. Thus there are p groups of л subjects each (np subjects in all); there are 
qr observations on each subject. 

The observations on subjects in group i may be represented as follows: 


Ы NES b, 
Subject | —— Total 
сі 57 с, ү o с, 
10) Жазар <- Xan ЭТА. ER Akari Pin 
а; m(i) ОИСИ At о Xam 777 Хат Раш 
ni) Xin 54 Жан quis Xian °° Xam Prò 
a Aer a ит зш et сш. 


ттт indicates ап observation on subject m(i) under treat- 

Ment combination abc;,. The notation Pc denotes the sum of the qr 

Observations on subject m in group i. Unless there is some ambiguity 

а which group is under discussion, the notation P,, will be used to denote 
S total. 


The notation Y, 
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Assuming that A, B, and C are fixed factors, the analysis of variance 
generally takes the form shown in Table 7.3-1. Тһе expected values of the 
mean squares indicate appropriate F ratios, provided that the sampling 


Table 7.3-1 Summary of Analysis of Variance 


————————————————————á 


Source of variation df E(MS)t 
Between subjects тр —1 
A р-і оў + qro? + nqro 
Subj w. groups pn — 1) оў + qro® 
Within subjects np(qr — 1) 
B 4-1 92 + ғай, + пргоў 
AB (р —1)(4—1) о? + ғо, + тоў 
B x subj w. groups р(п = Dq — 1) о + ға, 
с r—1 оў + 40%, + прдо? 
АС (р = Dq – 1) о? + да, + ngo?, 
С х subj w. groups pn — Mr — 1) of + qoi, 
BC (9 = 00" — 1) of + oF, + прод, 
ABC (p — Ig — Dr — 1) of + of, + nodo, 
BC subj w. groups p — 1g — Ir — 1) 9; + Oh. 


T Assumes A, B, and C fixed factors. 


distributions of the statistics involv 
be. What is assumed will be mad 
An alternative partition of the 


ed are actually what they are assumed to 
е explicit later in this section. 
total degrees of freedom is as follows: 


Га EE eee 


Degrees of freedom 


Source of variation 


General case 


Special case 
р= 2,9 = 3,ғ = 4,п = 5 


Between cells 
Within cells 
Subj w. groups 
B x subj w. groups 
C x subj w. groups 
BC x subj w. groups 


A cell in this context contains 


bination abcr. 


par — 1 

рға - 1) 
p(n — 1) 
pi — 1)(4 — 1) 
р(п — Yr — 1) 


po — 1)(9 — 1r — 1) 


23 
96 


the n observations under treatment com- 
There аге pqr cells. Treatment and interaction variation 


are obtained by partitioning the between-cell variation. The latter parti- 
tion is identicalforallp x q x r factorial experiments whether or not there 
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are repeated measures. The manner in which the within-cell variation 
is partitioned depends upon the pattern of the repeated measures. For the 
case being considered, each of the parts of the within-cell variation forms a 
denominator for some F ratio. Hence the following alternative notation 
system: 

MSsupj w. groups = MSerror(a), 


MSz x subj w. groups = MSerror(6); 
MSc x subj w. groups = MS error(e), 


MS nc x subj w. groups = MS егото): 


The alternative notation system has the advantage of being more compact 
as well as more indicative of the role of the error term in the F tests, for the 
special case being considered. However, the F tests will change as a func- 
tion of what model is appropriate for the data being analyzed. Thus the 
alternative notation has the disadvantage of not indicating how each of the 
terms is computed. 

Each of the parts of the within-cell variation may be checked for homo- 
geneity, the latter being one of the assumptions made when the F ratio is 
considered to be distributed in the form of the F distribution. Forexample, 


Subj w. groups pin = 1) 
Subj w. G, nzi 
Subj w. G, n—1 


An Ку test for the homogeneity of the parts would have the critical value 


max 


Баха), п — 1). As another example, 


BC x subj w. groups рп = DG — 0t = 0 
. BC x subj w. G, (n — 1)q — Dir — 1) 
BC x subj w. G, (п — 1g — Dr = 1) 


max 


variances of a set of variance-covariance matrices are equal. This test 
is only a partial check on whether or not sets of corresponding variance- 
covariance matrices may be pooled. More appropriate tests are generaliza- 
tions of those given in Sec. 7.7. 

А Each of the interactions with subjects may be shown to have the general 
orm 


An F,,.. test in this last case is, in part, a check on whether or not mean co- 


var — COV; 


with respect to a specified variance-covariance matrix. For example, 
Corresponding to the BC x subjects within groups interaction thereis a 
pooled рд x pq variance-covariance matrix. This matrix is assumed to be 
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an estimate of a population variance-covariance matrix in which the ele- 
ments along the main diagonal are all equal to o? and the elements off the 
main diagonal are all equal to p,o?. For each of the variance-covariance 
matrices, o? is assumed to be constant, but p may vary for the differ- 
ent matrices; that is, p is constant within any one matrix but varies for 
different matrices. If p were constant for all matrices, then дон = 00. 
and all interactions with subjects could be pooled. 

If the complete model is appropriate and if there is evidence to indicate 
that the assumptions with respect to the covariances do not hold, the critical 
values listed under the heading Conservative Test are more realistic than 
those given under the heading of Usual Test. 


ee 


Critical values 
F ratio AE 
Usual test Conservative test 
MS, m 
Маа SEES Fy—al(p — 1), p(n — 1)(4 — 1)] Е 1, p(n — 1) 
MS,» 
IEEE Fy—al(p — 1)(4 — 1), р(п — 1Х4- 1) Fy —ol(p — 1), pn — 1)] 
MS, 
Маш Finale — D. p — De — 1] Fi —all, p(n — 1) 
MS, 
MS pr ену Fy—al(p — 1)(r — 1), p(n — 1)(r — 1] Fi-al(p — 1), p(n — 1) 
ШЕВ 
Марсан еу Fy—al(q — 1r — 1), p(n — 1)(4 — 1r — 1)] Еу — Ml, p(n — 1) 
MS, 
МБрох нур жалар 7 1-740 — Da DE — D, pn — DG — De — D) | Fi ар — 1), p — 01 
E E a A T 


The expected values for mean squares given in Table 7.3-1 are obtained 
from a model which includes interactions with the subject factor. If in 
fact such interactions do not exist (or are negligible relative to the magnitude 
of o2), then 

E(MS eroro) — o? if 02-30) 

E(MS error) = o? — if eias 0, 

EMS error) = o? if of, = 0. 
In words, if all interactions with the subject factor are zero, each of the 
above mean squares is an estimate of the same variance, namely, that due to 
experimental error. Further, these estimates are independent and may be 
pooled to provide a single estimate of o2. Thus, 
SSerror(t) + SSerror(ey + SSerror(te) 

p(n — 1)(qr — 1) 

provides an estimate of c? having p(n — 1)(gr — 1) degrees of freedom. 


MSerror(within) = 
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If the experiment provides a relatively large number of degrees of freedom 
(say, over 30) for estimating the variance due to each of the interactions with 
subjects, there is generally no need to consider pooling procedures. When 
there are relatively few degrees of freedom for such estimates, the decision 
about pooling should depend largely on previous experimental work. In 
the absence of such background information, preliminary tests on the model 
are useful. The purpose of such tests is to provide the experimenter with a 
posteriori information about whether or not certain of the interactions with 
random factors should be included in the model for the experiment. Such 


Table 7.3-2 Denominator of F Ratio for Simple Effects 


Simple effect Denominator of F ratio 
A at b; АВ, — АВ; [MSerror(ay + (4 — 1)MSerror (01/9 
A at cy WG p SE [MSerror (а) + (r — 1)MSerror (0)/^ 
B at a; ABa — АВ, MSerror (0) 
C at aj АСд — АС, MSerror(c) 
Bat c; ВС HG. IMSerror w) + € = 1)Мбішог(кЙ/Р 
Cat b; BC; — ВС) [MSerror (o) + (4 = 1)MSerror (6e)1/9 
A at beg ABC, — АВС MS, сеп 


= 


tests should be made at numerically high levels of significance (that is, 
X = .20or% = .30). This procedure does not drop a term from the model 
unless the data clearly indicate that such terms can be dropped. Since terms 
are dropped when tests on the model do not reject the hypothesis being 
tested, high power is required. un 

Bartlett’s test for homogeneity of variance may be used to indicate 
Whether or not interactions with subjects can be pooled. (Pooling is 
equivalent to dropping terms from the model.) A rough but conservative 
test for pooling of this kind is to take the ratio of the largest to the smallest 
of the mean squares due to interactions with subjects. Use as a critical 
value for this ratio F'g(df,,df,), where the df, and df, аге the respective 
degrees of freedom for the largest and smallest of the respective mean 
Squares. For this procedure, the level of significance is larger than @ = 20, 
This test is biased in the direction of rejecting the hypothesis of homo- 
geneity (i.e., does not lead to pooling) more often than an «-level test. 
The critical value errs on the side of being too small. If the hypothesis 
Я homogeneity is rejected by the rough procedure, Bartlett’s test should 

е used. 

Tests on simple main effects have denominators of the form shown in 
Table 7.3-2. With the exception of simple effects at level a;, these denomi- 
nators are actually the pooled error variance for the over-all main effect 
and the corresponding interaction. For example, the variation due to the 
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simple main effect of factor A at level b; is a mixture of the variation due to 
the main effect of factor A and the variation due to the AB interaction. It 
is readily shown that 


SSerror(a) + SSerror() MSerror(a) + (4 — 1)MSerror() 
p(n — 1) + p(n — 1)(4 — 1) 4 
In practice, it is simpler to add the sum of squares and divide by the pooled 
degrees of freedom than it is to take a weighted mean of the mean squares. 


Tests on the difference between simple main effects take the following 
form: 


я АВ) — АВ); 
V2[MSerrona + (4 =) Маго лға 
AC AGH 


t 


V2[MSenor@) + (7 — )MSeror@]/nrq | 


An approximate critical value for a ¢ statistic of this kind is obtained as 
follows: Let t, and t, be the critical values for a test of level of si gnificance 
equal to « for the degrees of freedom corresponding to MS, and 
MS, в, respectively. Then an approximate critical value for the t 
statistic is 
taMSerror(a) + t(q — 1)MSerror(s) 

Моа) + (9 — 1)MSerrore) — 


This critical value is suggested by Cochran and Cox (1957, р. 299). Іп cases 
in which the degrees of freedom for the mean Squares are both large (say, 
over 30), the critical value may be obtained directly from tables of the 
normal distribution. 

Computational Procedures. With the exception of the breakdown of 
the within-cell variation, computational procedures are identical with those 
ofap x q x r factorial experiment having п observations per cell. These 
procedures will be illustrated by the data in Table 7.3-3. 


leritica] = 


Table 7.3-3 Basic Data for Numerical Example 


Periods: b b, b. 
Subjects Е E Я Total 
Dials: сі c, е C Foret о үста. reg 
1 45 53 60140 52 57 | 28 37 46| 418 
а 2 35 41 50| 30 37 47 | 25 32 41 | 338 
3 60 65 75 | 58 54 70 | 40 47 50| 519 
4 50 48 61/25 34 51] 16 23 35| 343 
а 5 42 45 55 |30 37 43 | 2 27 37| 338 
6 56 60 77 |40 39 57/31 29 46| 435 
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Table 7.3-4 Summary Tables for Numerical Example 
Ем... Ды SS eee 


ABC summary table 


b, bs bs 
сі Cy ез сі Co ез сі Cy св 
a, | 140 159 185 | 128 143 174 93 16 137 
ы. | 148 153 19 95 110 151 69° “719777118 
288 312 278 | 223 253 325 | 162 195 255 
AB summary table AC summary table 
by bz 8 сі с C3 
(i) а, | 484 445 346 1275 a, | 361 418 496 1275 
a, | 494 356 266 1116 аз | 312 342- 462 1116 
978 801 612 2391 673 760 958 2391 
BC summary table 
сү с с 
by 9867, 212378.” 1918 
b, | 223 253 35 801 
bs 162. 195 255 612 
673 760 958 2391 
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Peis) lin SSS 
В x subj w. G summary table 


B x subj w. б, summary table 


Subject | b; by bz Subject | bı bz 8 
1 158 149 111 418 4 159 110 74 343 
2 126 114 98 338 5 142 110 86 338 
3 200 182 137 519 6 193 136 106 435 
484 445 346 1275 494 356 266 1116 


Gi) C x subj w. G, summary table 


C x subj w. Сз summary table 


Subject| су cy Ca Subject | а е е 
1 13 142 163 418 4 91 105 147 343 
2 90 110 138 338 9, 94 109 135 338 
3 158 166 195 519 6 127 128 180 435 
361 418 496 1275 312 342 462 1116 


ТЫЛУЫ Sa ЕР 


326 MULTIFACTOR EXPERIMENTS HAVING REPEATED MEASURES 


Suppose that the levels of factor A represent the noise background under 
which subjects monitor three dials. Тһе latter define factor C. Subjects 
are required to make adjustments on the respective dials whenever needles 
swing outside a specified range. Accuracy scores are obtained for each 
dial during three consecutive 10-min time periods (factor B). The basic 
data for this experiment are given in Table 7.3-3. Subjects 1, 2, and 3 make 
up G,; subjects 4, 5, and 6 make ир Gə. To illustrate the meaning of the 
data, during the first 10-min interval (5,) subject 1 had scores of 45, 53, 
and 60 on dials су, сә, and сз, respectively. 

Summary tables prepared from these basic data are given in Table 7.3-4. 
In part i are summary tables that would be obtained for any 2 x 3 x 3 
factorial experiment having n observations per cell. Part ii is unique to 
a factorial experiment having repeated measures on factors Band C. In 
the B. x subjects within G, summary table a cell entry will be denoted by 
the symbol ВР. For example, BP,, = 158, and BP, = 111. Similarly 
an entry in a cell of the C x subjects within group summary table will be 
denoted by the symbol CP,,,. 

Convenient computational symbols are defined and computed in Table 
7.3-5. Symbols (1) through (9) are identical to those used in anypxqxr 
factorial experiment in which there are n observations in each cell. Symbols 
(10) through (12) are unique to a factorial experiment in which there are 
repeated measures on factors B and C. By using these symbols, computa- 
tional formulas take the form given in Table 7.3-6. In terms of means, 


55а x subj w. groups = FED XBP,,() — АВ, — Pno + Ay. 
ijk 


The formula for this source of variation given in Table 7,3-6 leads to simpler 
computations. This source of variation is also designated SS orrore): 

There is an over-all computational check that can be made on the sum of 
the error terms. The following relationship exists: 


SS errora) = (10) — (3) 

SS error = (11) — (6) — (10) + (3) 

SS «nor = (12) — (7) — (10) + (3) 

SS errore) = (2) — (9) — (11) — (12) + (6) + (7) 
) 


+ (10) — (3 


The computational symbols on the right may be treated as algebraic 
symbols; that is, (3) + (3) = 2(3), (3) - (3) =0. The algebraic sum 
of the symbols on the right is (2) — (9). The latter is the computa- 
tional formula for 55 uElie computational symbols at the bottom of 
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Table 7.3-5 are used to partition the error variation for each of the error 
terms into parts which may be checked for homogeneity by means of an Fmax 
test. 

The analysis of variance is summarized in Table 7.3-6. Ifa model which 
includes interactions with subjects is appropriate for this experiment, the 
expected values for the mean squares are those shown in Table 7.3-1 and the 


Table 7.3-5 Definitions and Numerical Values of Computational 


Symbols 
(1) = G?|npgr = (2391)?/3(2)(3)(3) = 105,868.17 
(2) = =x? = 452 + 532 + 602 + +++ + 318 + 29° + 46? = 115,793 
(3) = (2A3)/ngr = (1275? + 11162)/3(3)(3) = 106,336.33 
(4) = (2B?)/npr = (978° + 801? + 6122)/3(2)(3) = 109,590.50 
(5) = (2C2)/mpq = (673° + 760° + 958%)/3(2)(3) = 108,238.50 
(6) = [X(AB;]nr = (484? + 445? + - - + 266°)/3(3) = 110,391.67 
(7) = [X(AC4)!]/ng = (361? + 418° + -- + + 4622/33) = 108,757.00 
(8) = [X(BC;)?]/np = (288° + 3122 + ··· + 2552)/3(6) = 111,971.50 
(9) = [Z(ABC;;,)2I/n = (140? + 159° + --- + 1185/3 = 112,834.33 
(10) = (ЕР )/дг = (418° + 338? + --- + 4352)/3(3) = 108,827.44 
(11) -[X(BP;,Y]]r = (158° + 149° + --- + 1062/3 = 113,117.67 
(12) -[X(CP,,)]/q = (113? + 1422 + :- - + 1802/3 = 111,353.67 
ЧИННЕ А САН 7 0 1-22 m 
(10а) = (zra) Jar = (418? + 338° + 5195/33) = 62,036.56 
i 
(1043) = (ЕР) Jar = (3432 + 338° + 435%)/3(3) = 46,790.89 
a 108,827.45 
1 = 25 2 B... + 1373/3 = 63,285.00 
(Па) (Е(вғ),)| | (1582 + 1492 +--+ + 1375) 
(Па) = У(аРу,) |» = (1592 + 110? + ++ + 106%)/3 = 49,832.67 
З 113,117.67 


== eee ee Е 


structure of F ratios is determined by these expected values. However, 
should the existence of interactions with subjects be open to question, pre- 
liminary tests on the model are appropriate. In this case such interactions 
have relatively few degrees of freedom. A check on the homogeneity of 
such interactions is carried out in Table 7.3-7, by use of Bartlett’s test. 
Since the observed chi square (4.77) exceeds the critical value (3.22) for a test 
with « = .20, the test indicates that the interactions should not be pooled. 
Equivalently, the test indicates that interactions with subjects should not be 
dropped from the model. 
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Thus tests on В and AB use Мб as a denominator for F ratios; 
tests on C and AC use MSerore) as a denominator; tests on BC and ABC 
use MS,,,,,, 45 a denominator. The main effect of А is tested with 
MS,,,,,. By using о = .05 for all tests, the main effects for factors В 
(periods) and C (dials) are found to be statistically significant. Inspection 
of the totals for levels у, b», and b; indicates decreasing accuracy scores for 


Table 7.3-7 Test for Homogeneity of Interactions with Subjects 


55 | MS | df log MS аг 
234.89 29.36 8 1.468 125 
105.56 13.20 8 1.121 125 
127.11 7.94 16 900 062 
=SS = 467.56 ZS df = 32 E(1/df) = .312 


_ ee а М гы ее снег 


MSpootea = (2 SS)/E df = 467.56/32 = 14.61 


A = X (dt); log MS,] = 8(1.468) + 8(1.121) + 16(.900) = 35.112 
В = (© df) log MSyootea = 32(1-165) = 37.280 
Lus — (® 40] = 1 + 0312 — .031] = 1.047 

ст gy pala - 00240] =1 + И ] 


_ 2.303(B — A) _ 2.303(37.280 — 35.112) 
m C + 1.047 


#2) = 3.22 
es a err n _ 


pa = 4.77 


the consecutive time periods. Inspection of the totals for the dials indicates 
that dial c, is monitored with the greatest accuracy and dial c, monitored 
with the least accuracy. 

The AB interaction is also noted to be statistically significant. The pro- 
files of means corresponding to the cell totals in the 4B summary table are 
plotted in Fig. 7.1-3. The profiles indicate a difference in the rate of decline 
in the accuracy scores in the three periods, the group working under noise 
level a, showing a slower decline rate than the group working under аҙ. 
Differences between corresponding points on these profiles have the form 


AB,; — AB. 


The standard error of the difference between these two means is estimated by 


2(SSsupi w. groups Æ 55в x subj w. groups) Ес: К н s 234.89) 
nr[p(n — 1) + р(п — 009 — D] 9(12) 
= 45048 = 7.10. 
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By way of contrast, the standard error of the difference between two means 
of the following form, 


АВ, X АВ, 
is estimated by 


V2MS „пе = v/2(29.36)/9 = 2.55. 
The latter standard error is considerably smaller than that computed in the 
last paragraph. А difference of the form A By; — АВ, isin part confounded 


with between-group effects, whereas a difference of the form АВ, — АВ» 
is entirely a within-subject effect. 


55 
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Figure 7.1 Profiles of means. 


Illustrative Applications. Many examples of this basic plan will be 
found in the recent experimental literature. A study by French (1959) 
illustrates one application of this plan. The purpose of this study was to 
investigate the effect of lesions in two different areas of the cortex (factor A) 
upon a bar-pressing task which set up two conditions of illumination (factor 
B). Each of the subjects was given three blocks of trials (factor C) under 
each illumination condition. There were four subjects per group. The 
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criterion measure was the average length of time the bar was depressed 
during each block of trials. To obtain homogeneity of error variance, the 
scale of measurement was transformed into log (X + 1). The addition of 
unity to each of the means avoids the occurrence of the logarithm of zero. 
The plan for the experiment may be represented as follows: 


| Illumination: by bz 
Lesions 
Trials: сі Cy су | Cy с; Cy 
a б, бі 6,| бу б, бі n=4 
а б, Gz б, | б, б, Gz 


The analysis of variance reported by French had (һе form given in Table 
7.3-8. 

In this analysis all interactions with subjects were pooled. The resulting 
pooled interaction term was the denominator for all within-subject effects. 
The significant main effect due to factor A indicated that the groups differed 


Table 7.3-8 Analysis of Variance for French Data (1959) 
eS иаа 


Source df MS Е. 
Between subjects | 7 
_ A (lesions) 1 846 6.89* 
Subj w. groups 6 122 
Within subjects 40 
B (illumination) 1 4161 1.46 
С (trials) 2 945 8.61% 
АВ 1 .008 
AC 2 389 3.54* 
BC 2 113 1.03 
ABC 2 .078 
Pooled interactions 
with subjects 30 .110 


а a PRSE pete Ue 1721.72 жый ee 
in their mean over-all criterion scores (on the transformed scale of measure- 
ment) The significant main effect due to trials indicated that the means 
changed during the trials. The significant lesion x trial (AC) interaction 
indicated that the rate of change during the trials differed for the two groups. 

Briggs, Fitts, and Bahrick (1958) have reported a series of experiments in 
which the experimental plan under discussion was used. In one of these 
studies, the effects of methods of training (factor A) upon transfer of training 
ina complex tracking task were studied. Performance under the last two 
blocks of trials in the training session and the first two blocks of the transfer 
session was analyzed. The plan of the experiment may be represented 
schematically as follows: 
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Method Training Transfer 
of 

training Trial c, | Trial с; | Trial c | Trial c; 
a, б, G; G, G, 
а Ga G G5 " 
аз Gs б; Gz б; 


The last two blocks of trials under training and the first two trials under 
transfer define the two levels of factor B. There were 17 subjects in each 
group. This plan may be considered as a special case of a4 x 2 x 2 fac- 
torial experiment with repeated measures on the last two factors. The 
criterion measure was average time on target. The analysis of variance 
reported by these workers had the form given in Table 7.3-9. 


Table 7.3-9 Analysis of Variance for Briggs, Fitts, and 
Bahrick Data (1958) 


eee АА 


Source df MS F. 
Between subjects 67 
A (methods of training) 3 2,319.02 3:728 
Subj w. groups 64 623.72 
Within subjects 204 
B (training-transfer) 1 11,492.52 113.11* 
AB 3 707.44 6.96* 
B x subj w. groups 64 101.61 
C (blocks of trials) 1 5.95 
AC 3 9.20 
C x subj w. groups 64 90.45 
BC 1 111.90 2.06 
ABC 3 24.84 
BC x subj w. groups 64 54.28 


In the actual experiment, G, worked under identical conditions during the 
training and transfer trial. Each of the other groups had different con- 
ditions under the learning trials but a common condition under the transfer 
trials. The following means were obtained from the experimental data. 


by b; 
аб о, 
a | 236 2.54 
кедш 25 
а | 1.80 2.36 
аз | 145 201 
a | 1.54 2.31 
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Because of the significant interaction, key simple main effects were tested. 
Means within brackets do not differ significantly, while those means not 
bracketed do differ at the .05 level. It is noted that all groups were equally 
good, on the average, on the transfer block of trials. The significant AB 
interaction was due largely to the fact that the control group (a,) showed 
no statistically different increase from training trials (b,) to transfer trials 
(b), whereas all the other groups did show a statistically significant 
increase. 

Another study reported by Briggs, Fitts, and Bahrick (1958) may be con- 
sidered as a special case of a 4 x 3 x 3 factorial experiment with repeated 
measures on the last two factors. In this study, factor A defines the time at 
which transfer was made from a common training condition to a common 
transfer condition. Group 4 made the transfer at time zero; i.e., group 4 
worked under the transfer condition during all sessions. A schematic 
representation of the plan is given below. 


Transfer | Sessions: b, by bs 
pme | Trials: с Ca cg Ch) c а 4 “Ca "Cr % C» 4 
ау | б, бу б, б, | б) б, б, 6) | 6 б) бі 6, 
а G, Gz б, G,| б, бұ б, б | Go б, бұ Ge 
аз б, б, б, бу | бу бз Gs бу | бу бу Cs Gs 
% б, б, б, б, | б, б, Gy 6; | б, б, 6, Gi 
—— d c = айы Еа БЕБЕ ЕЕЕ: 


п, = 14, п; = 12, пу = 14, n, = 13 


Because of unequal group sizes, the equivalent of an unweighted-means 
analysis was made on the data. In this analysis, group means replace each 
of the Хз and the data are considered as if n were unity in the calculation of 
the parts of the between-cell variation. Sums of squares computed in this 
manner are then multiplied by the harmonic mean of the group sizes. 

The subject within groups variation is given by 


2 
ОР xA. 
qr n; 


SSsupj w. groups — 


The B x subject within groups interaction is most easily obtained by com- 
puting the separate terms B x subject within б, and then pooling the parts. 
Other interactions with subjects are computed in a similar way. The 
analysis of variance reported by these workers had the form given in Table 
7.3-10. Significant interactions led to a series of tests on simple effects as 
well as individual comparisons between pairs of means. Graphs of profiles 
Corresponding to interactions were used effectively in the interpretations. 


334 MULTIFACTOR EXPERIMENTS HAVING REPEATED MEASURES 


A study by Casteneda and Lipsett (1959) provides another illustrative 
example of the same basic experimental plan. This study was designed to 
evaluate the effect of level of stress (factor A) upon learning patterns of 
varying complexity (factor B). Four block of trials (factor C) were given 


Table 7.3-10 Analysis of Variance for Briggs, Fitts, and 


Bahrick Data (1958) 
Өн 20 ee i ee 
Source df MS F 
Between subjects 32 
A (transfer time) 3 45.79 
Subj w. groups 49 69.76 
Within subjects 583 
B (sessions) 2 426.51 17.28* 
AB 6 266.12 40.78* 
B х subj w. groups 98 24.68 
C (trials) 3 144.11 46.04* 
AC 9 2.34 
C x subj w. groups 147 3.13 
BC 6 16.45 5.24* 
ABC 18 10.61 3.38* 
BC x subj w. groups 294 3.14 


I—————MÀ м n 


under each level of complexity. The criterion measure was the number of 
PLA responses in each block of trials. Тһе plan may be represented as 
ollows: 


Level | Pattern complexity: by by 
of = 

stress Trials: с, Cy C3 с сі с; C3 EI 
а CH С СА 611027262526, o Ga 
а СЕС "02770276; с, 


There were 54 subjects in each group. The order of presentation of the 
Leni was randomized, but each pattern was presented an equal number 
of times. 

The analysis of variance reported by these workers had the form given in 
Table 7.3-11. It will be noted that relative to the within-subject error 
terms the between-subject error term is exceptionally large (MS, w. groups 
— 804.66). This observation and remarks made by the workers about 
the distributions suggest that the analysis of variance might more appro- 
priately have been made in terms of a different scale of measurement, that 
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is, log X rather than X. In the analysis as it stands, the stress x pattern 
interaction (AB) was found to be statistically significant. In terms of the 
experimental variables, this finding indicated that the stress condition 
interfered more in the learning of the complex pattern than it did in the 
learning of the simple pattern. This essentially was the hypothesis that 
the workers were interested in testing. Because of the extremely high 


Table 7.3-11 Analysis of Variance for Casteneda and Lipsett 


Data (1959) 
Source df MS F 
Between subjects 107 
A (stress) 1 0.78 
Subj w. groups 106 804.66 
Within subjects 756 
В (patterns) 1 1820.04 255.26* 
АВ 1 148.34 20.80* 
B x subj w. groups 106 7.13 
С (trials) 3 167.13 80.40* 
AC 3 4.01 2.02 
C x subj w. groups 318 1.98 
BC 3 7.38 2.64 
ABC 3 6.48 2.32 
BC x subj w. groups 318 2.79 


en "ЕКОО Ул Л — ара 


between-subject error term simple main effects of the stress conditions 
could not be demonstrated to be statistically significant. 

General Expected Values for the Mean Squares. The expected values 
for the mean squares given in Table 7.3-1 are for the special case in which 
А, B, and С are fixed factors. The general case is given in Table 7.3-12. 
In this table, 


Thus D, is either 0 or 1 depending upon whether factor 4 is fixed or 
random, respectively. А - 

If D, = 1, D, = 0, and D, = 0, then the special case given in Table 
7.3-13 is obtained. There are several assumptions about the form of 
variance-covariance matrices that must be met in order that these expected 
values provide reasonable guides for setting up F ratios. A series of pre- 
liminary tests on the model may be made, in the absence of background 
information, in order to check the desirability of pooling various inter- 
actions with random factors. 
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Table 7.3-13 Expected Values of Mean Squares for Case in Which 
Factor A Is Random and Factors B and C Are Fixed 


Source E(MS) 
Between subjects 
A o? + qro? + ngroz 
Subj w. groups o? + qro? 
Within subjects 
m о2 + гоў, + nroz, + проз 
AB оў + год, + то 
B x subj w. groups оў + re$. 
c оё + 93, + ngozy + прфо) 
AC вр + 407, + 4%, 
C x subj w. groups o + qoi, 
BC о? + 03, + NOz,,, + проф, 
ABC о? + of, + поё; 
BC x subj w. groups 02 + Of, 


7.4 Three-factor Experiment with Repeated Measures (Case 110) 


In the last section the case іп which there were repeated measures оп two 
of the three factors was considered. In this section the case in which there 
are repeated measures on only one of the three factors will be considered. 
This case may be represented schematically as follows: 


сү Ca oui" €, 
ы Gu Gy PC Gy 
а : 
ь, б, Gu a Gi, 
bi Gy Gy ЖЕ Gip 
а, ; 
b, Goo Gor, ae Сы 


Each of the groups is observed under all levels of factor С, but each group is 
assigned to only one combination of factors A and B. The notation б), 
denotes the group of subjects assigned to treatment combination ab;. A 
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subject within group б); is identified by the subscript m(ij). This notation 
indicates that the subject effect is nested under both factors А and В. 

The structural model on which the analysis which follows is based has the 
following form: 


E(X itm) = И + В; + «б, + TmGi) 
F Ук + уа + Вуљ + «бурк F Уту). 

Since the subject factor is nested under both factors A and B, there can be 
no interaction between these latter factors and the subject factor. This 
model has implicit in it homogeneity assumptions on variance-covariance 
matrices associated with the repeated measures. The analysis of variance 
for this plan takes the form given in Table 7.4-1. The expected values in 


this table are for the special case in which A, B, and C are considered fixed 
factors. 


Table 7.4-1 Summary of Analysis of Variance 
EEE ee ee eee 


Source of variation df E(MS)t 
Between subjects прд — 1 Е 
А р-і og + ro? + паго 
B q—1 og + го? + пргоў 
AB (р = 1)(9 — 1) of + ғо? + nro®, 
Subj w. groups 
[error (between)] pan — 1) o? + ro? 
Within subjects npq(r — 1) 
с r-1 08 + оз, + npgo? 
AC (p = 1r — 1) of + 02, + пдо?, 
BC (9 = DY —1) оў + of, + прод, 
ABC (p — D(q — Dr — 1) 92 + 0$, + поё, 
C x subj w. groups 
[error (within)] pan — Yr — 1) Gy os. 


T Assumes A, В, and C fixed factors. 


An alternative partition of the total variation permits a comparison 
between this plan and a P X q X r factorial experiment in which there are 
no repeated measures, but n observations per cell. 


—————— — е ей НИИ 


Source df 
Total npqr — 1 
Between cells pgr — 1 
Within cells pqr(n — 1) 
Subj w. groups рап — 1) 
C х subj м. groups рдп — 1)(г — 1) 


— нн Ушу 
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The main effects and all interactions of factors A, B, and C are part of the 
between-cell variation whether or not there are repeated measures. Тһе 
partition of the between-cell variation is identical in the two cases. When 
there are repeated measures on factor C, the within-cell variation is sub- 
divided into two parts. One of these parts is 


SSsubj w. groups = rXXX(P,G5 M Ав,» 
T" EXX(P FA ber XX(AB; 
i m nnde 
(The symbol XEXP?,; represents the sum of the squared totals from each 
subject. Each totalis based upon r observations.) This source of varia- 
tion is a measure of the extent to which the mean of a subject differs from the 


mean of the group in which the subject is located. The other part of the 
within-cell variation is 


SSo x subj w. groups == SSw. cell = SSsubj w. groups; 


where SSw: cet = (Xena LAB GI 
X(ABC?, 
ду we Хавс ; 


Because of the structure of the F ratio for this plan (when А, B, and C are 
fixed factors), the following notation is sometimes used: 


SSsubj w. groups = SSerror(between), 


SSc x subj w. groups == SSerror(within)- 
Each of these error terms may be subdivided and tested for homogeneity by 


means of F,,,, tests. The first term may be subdivided into the following 
parts: 
Hui d 
Subj w. Gu uc 
Subj w. Giz nc 
Subj w. Gq (ld 


There will be pg terms, each having the general form 
SSsunj w.G, = ДЕРТІ) ігі Ав,» 


The critical value for an F,,, test would be Аа-а) (pq; n — 1). The 
second error term may be divided into the following parts: 


Source df 
C x subj w. Gu (n = Yr — 1) 
C x subj w. Сүз (n — D(r — 1) 


C x subj w. Са (n — Qr = 1) 
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There will be рф terms; each has the general form 


550 x subj w. ба XXn PG) АВС, + АВ,). 


mk 


The critical value for an F,,,, test in this case would be 


TE [pq, (n = Dr = 1)]. 


Should either of these two error terms prove to be heterogeneous 
in terms of the criterion scale of measurement being used, the experimenter 
should consider a transformation on the scale of measurement in terms of 
which the analysis of variance may be carried out. 


Table 7.4-2 Denominator of F Ratio for Simple Effects 
a a ee 


Simple effect Denominator of F ratio 


Aatb; AB, — АВ, 


= ржы MS, " 
B at a; АВ, - АВ, 'error(between) 


Cana „Аб MG 
Cath; ВС, — ВС, MSerror(within) 
Cat abi; АВС, — АВС, 


Aatc, АС,- АС 
Bate ВС, — ВС [MSerror(between) + (ғ = 1)MSerror(within)]/” 
AB at cy АВС,,, — АВС,,, 
fa EE ee 
In making tests on simple main effects, denominators appropriate for F 
ratios are indicated in Table 7.4-2. It should be noted that a difference 
between two simple main effects is a mixture of main effects and interaction 
effects. In cases where the main effects and interaction effects have dif- 
ferent error terms, a compromise error term is constructed. The latter is a 
weighted average of the different error terms, the weights being the respective 
degrees of freedom. Because of this pooling of heterogeneous sources of 
variation, the resulting F tests are potentially subject to bias. 
In this design, when the assumptions of homogeneity of covariances are 


questionable, critical values of the conservative tests involving factor C 
have the form 


Fill, рап — 1) insteadof А [(r— 1), pq(n — 1)(r — 1)], 
Р, — 1), pa(n — 1) 
instead of — F,_,[(p 1)(r — 1), ра(п — 1)(r — 1)]. 


That is, the degrees of freedom for numerator and denominator are each 
divided by r — 1. 
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Computational Procedures. А numerical example will be used to 
illustrate the computational procedures. [This example is a modified ver- 
sion of an experiment actually conducted by Meyer and Noble (1958). 
Suppose that an experimenter is interested in evaluating the effect of anxiety 
(factor A) and muscular tension (factor B) ona learning task. Subjects who 
score extremely low on a scale measuring manifest anxiety are assigned to 
level a,; subjects who score extremely high are assigned to level а». The 


Table 7.4-3 Basic Data for Numerical Example 
— À "| a — — A —— —— 


Blocks of trials 
Subjects Total 
c са єз “4 
! 
1 18 14 12 6 50 
bı 2 19 12 8 4 43 
3 14 10 6 2 32 
а 

4 16 12 10 4 42 
bs 5 12 8 6 2 28 
6 18 10 5 1 34 
7 16 10 8 4 38 
b 8 18 8 4 1 31 
9 16 12 6 2 36 

а | 
10 19 16 10 8 53 
b, 11 16 14 10 9 49 
Е 12 16 12 8 8 44 

~—_| |) за Ел SSS 


tension factor is defined by pressure exerted on а dynamometer. One half 
of the subjects at level a, are assigned at random to tension condition bi; 
the other half are assigned to level ba. The subjects at level a, are divided in 
a similar manner. Subjects are given four blocks of trials (factor С). The 
criterion is the number of errors in each block of trials. Suppose that the 
observed data are those given in Table 7.4-3. И 

In this table subjects 1, 2, and 3 form group Суу; subjects 4, 5, and 6 form 
group б; etc. Subject 6 is represented symbolically as Psa), that is, the 
third subject in Gy. This plan may be classified as a 2 x 2 x 4 factorial 
experiment with repeated measures on the last factor, п = 3. Summary 
data obtained from the basic observations are given in Table 7.4-4. АП 
these summary tables are identical to those that would be obtained for a 
2 x 2 x 4 factorial experiment having no repeated measures. Computa- 
tional symbols are defined and evaluated in Table 7.4-5. Symbol (10) is 
the only one unique to a repeated-measures design. The data for the latter 
symbol are obtained from the Total column in Table 7.4-3. 

The analysis of variance is summarized in Table 7.4-6. Suppose that the 
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Table 7.4-4 Summary Table for Numerical Example 


ABC summary table 


| сі сз C3 C4 Total 
by 51 36 26 12 125 
€x | "B, | 46 730 721 7 104 
by 50 30 18 y, 105 
баа. n UR Er MT. 146 
198 138 93 51 480 
AB summary table AC summary table 
b, by Total а Cy C3 e Total 
a | 125 104 229 ау 97 66 47 19 229 
a | 105 146 251 а | 101 72 46 32 251 
LR MAMMA лайы. 2 
230 250 480 198 138 93 51 480 
BC summary table 
сі Cy ез с Total 
by 101 66 44 19 230 
by 97 рғ 490 32 250 
198. 138 93% SÍ 480 


I = — ÁN O 


Table 7.4-5 Definitions and Numerical Values of Computational 


Symbols 
(1) = G?/npqr = (480)2/48 = 4800.00 
Q)-xx? 18? + 142 +... 4 g2 4 ge = 6058 
(3) = (24D /ngr = (229? + 251224 = 4810.08 
(4) =(2B})/mpr — = (2302 + 2502)/24 = 4808.33 
(5) -(ZCp/nq = (198° + 1382 + 932 + 512)/12 = 5791.50 
(6 = [X(AB?)l[nr = (125° + 1042 + 105° + 1462)/12 - 4898.50 


(7) = PXAC$JI/ng = (97° + 662 


(8) = [XGBCT)]|np = (1012 + 662 
(9) = 
(10) = 


(ЕР) = (50° + 432 


+ ... 
ae imn = (512 + 36° +... 


46? -+ 322)/6 = 5810.00 
+ 492 + 322)/6 = 5812.00 
+ 282 + 252)/3 = 5923.33 
+ 492 + 442)4 = 4981.00 


WYM) 10419 
LUT vc LETS (9) + (00) — (6) — @ le aa zi 2 
961 Scv € Laat (D — (9 + @) + ©+@)-W- (9)— © 28V 
181 90% € LECT (0+ (9— Ф – (9) эя 
6СІ 182 € [^25] (D (9 — (9— (D OV 
xO£ CST 0$`0ЕЄ E 0S'166 (D — © (sem) 2 
9€ 001101 (00 — © syoofqns uyum 
4 " 2 [(uo9419q) 10119] 
ІСОІ 8 0628 (9) - (00) sdnoi3 "A fans 
«LLL 60708 I 60708 (D +) — (©) — (9) gv 
£€8 I EES (1) = Ф) | (иопәз) g 
80°01 i3 8001 (0 = (0) _ (hopue) V 
П 00181 (D = (00) sqoofqns иәәмјә9 
Ч SW JP SS түпшло} [euonejnduro?) 


чоцуюел JO INOS 


әзививд JO sisA[euy Jo Алешшп$ 9-р'/ AJEL 


en 
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05 level of significance is used in all tests. The main effect for factor C 
(trials) is found to be Statistically significant. This indicates that the average 
number of errors differed in the four blocks of trials. Inspection of the 
totals for the blocks indicates a decreasing number of errors from сі to c4 
The anxiety x tension interaction is also statistically significant. This 
indicates that the pattern of the number of errors in the two anxiety groups 
depends upon the level of muscular 
tension. The profiles corresponding 
Bigannenvo to this interaction effect are shown in 
/ Fig. 7.2. These profiles indicate that 
10 ту; the effect of muscular tension upon 

i 5 > 
7 number of errors differs—high- and 
low-anxiety-level groups perform in 
different ways. A test on the differ- 
—] ence between mean number of errors 
between the two anxiety levels under 
the no-tension condition (b,) is given 


b 
ET PUDE AR 
2nrMSerror(between) 


DE. 4 2 
Low anxiety = (125 — 105% 105) == 1.62. 
100 | 2(12)(10.31) 


b Ong . E 
1 2 
No tension High tension һе Critical value for this test is 


Figure 7.2 Ео(1,8) = 5.32. 


ooh Thus the data indicate no statistically 
Significant difference between the high- and low-anxiety groups under the 
condition of no tension. A test on the difference between the high- and 
low-anxiety groups under the high-tension condition is given by 


в AB ~ ABS* (104 — 146) 
2nrMSerror(between) 2(12)(10.31) 


150 


m 
РЧ 
© 


Total errors 


m 
m 
o 


110 


= 7.13. 


Apart from the homogeneity assumptions required for the validity of the 
usual Е tests, the Variance-covariance matrices provide information which is 
of use in its own right in describing processes operating in the experiment. 
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The variance-covariance matrices associated with the data in Table 7.4-3 are 
given below: 


Level aby, Level ab;s 
ĉi с E “4 Cj Ca е “4 
сі | 7.00 4.00 5.00 4.00 c, | 933 4.00 0.00  —.66 
в 4.00 6.00 4.00 Co 4.00 4.00 2.00 
сз 933 6.00 сз 7.00 4.00 
Cy 4.00 Cy T 2.34 
Level aby, Level abs; 
e са Cg C4 e Co C3 C4 
| 
€; | 133 —200 -2.00 —134 сі | 300 3:00 100 —.50 
С 4.00 2.00 1.00 с 4.00 2.00 .00 
сз 4.00 3.00 C3 1.33 593 
[^ 2.34 [^ „33. 


The уагїапсе-соуагїапсе matrix obtained by averaging corresponding 
entries in each of the above matrices is given below: 


Pooled variances and covariances 
сі с C3 с 


сі 516 225 1.00 .38 


© 400 3.50 1.75 
сз 5.42 3.33 
C4 2.25 


In this experiment, the levels of factor C represent successive blocks of 
trials in a learning experiment. Typically in this kind of experiment the 
variances tend to decrease as the learning increases. In the pooled variance- 
covariance matrix this trend is not clearly shown (5.16, 4.00, 5.42, 2.25). 
Further, the covariances between neighboring blocks of trials tend to be 
relatively higher than covariances between blocks which are farther apart. 
This trend is clearly shown in the pooled variance-covariance matrix. The 
symmetry conditions on the pooled variance-covariance matrix required for 
the strict validity of the usual F test do not, in general, hold for learning 
experiments. (Use of Hotelling's T° statistic would, however, provide an 
exact test even if the symmetry conditions do not hold.) Р 

From the pooled covariance matrix, the average of the entries along the 
main diagonal defines var, and the average of the > entries off the main diag- 
onal defines cov. In this case, var = 4.21, and cov = 2.04. In terms of 
the latter quantities, 


MSerror(between) = var + (r — 1) cov = 10.33, 
MSerror(within) = var — cov = 2.17. 
Within rounding error, these values are equal to those obtained for the 
Corresponding mean squares in Table 7.4-6. 
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Illustrative Applications. The numerical example just considered is 
actually a modified version of an experiment reported by Meyer and Noble 
(1958). The plan for their experiment was a 2 x 2 x 6 factorial experi- 
ment with repeated measures on the last factor. Factor 4 indicated anxiety 
level as measured by the Taylor Manifest Anxiety Scale. Factor Bindicated 
the level of muscular tension exerted on a hand dynamometer during the 
experiment. Factor C represents blocks of trials. The criterion was 
the number of errors per block in thecourse of learning a verbal maze. The 
purpose of the experiment was to investigate the interaction between mani- 
fest anxiety and muscular tension during a learning task. There were 20 
subjects in each of the four experimental groups. 


Table 7.4-7 Analysis of Variance for Meyer and Noble Data 


Source of variation df MS F 
Between subjects 79 
A (anxiety level) 1 2.33 
B (muscular tension) 1 .96 
AB 1 41.88 6.12* 
Subj w. groups 76 6.84 
Within subjects 400 
C (blocks of trials) 5 216.86 309.80* 
AC 5 2 
BC 5 .29 
ABC 9 .58 
C х subj w. groups 380 70 


The analysis of variance reported by Meyer and Noble is given in Table 
7.4-7. Тһе significant main effect for factor C indicated that the mean 
number of errors changed during the blocks of trials. The significant AB 
interaction indicated that the effect of muscular tension is different in the 
two anxiety groups. Inspection of the profiles corresponding to this inter- 
action indicated that the presence of muscular tension in the high-anxiety 
group tended to increase the number of errors relative to the no-tension 
condition. The effect in the low-anxiety group was just the opposite; under 


the tension condition there was a smaller number of errors relative to the 
no-tension condition, 
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Method оѓ | i 
КА ot Association Ld 
prior | 1 
presentation маме с сә 
| 
ы Gy Gy 
a by б Сіз 
8 Сіз 1з 
bi бы Gy 
а by Соз Gop 
bs Gog б» 


The levels of factor А define methods of prior presentation of the syl- 
lables. The levels of factor B define the association value of the syllables. 
However, only the syllables in list c, were included in the prior presentation. 
Syllables in list c; were matched with those in list c, for association value but 
were included in the experiment to serve as controls. There were 20 sub- 
jectsineach group. This experimental plan does not conform to the pattern 
of a2 x 3 x 2 factorial experiment, since treatment combination асу» is 
identical with treatment combination асу». 

This kind of plan should not in general be analyzed as a2 x 3 x 2 fac- 
torial experiment. No meaningful interaction between factors A and С 
exists. One possible method of analysis is to use the difference between 
thresholds on syllables in lists с, and c; as the criterion measure. In this 


case the analysis of variance takes the following form: 
me ШАШ ше кы 


Source df 

A (method) 1 
В (assoc. value) 2 
AB 2 
Within cell 114 


ences between subjects from 
analysis of variance, опе may 
he resulting data as a 3 х 2 
group. This analysis takes 


This type of analysis in part removes differ 
experimental error. Asa second phase of the 
combine the two levels of factor A and treat t 
factorial experiment having 40 subjects in each 


the following form. 
b | eee 


Source df 
Between subjects 19 
В (assoc. value) 2 
Subj w. groups 117 
Within subjects 120 
C (lists) 1 
BC 2 
117 


C x subj w. groups 


20 Quaf2)05 


I I I I I 
240 + go I I “а I I шаш 
“ои + "go + do I и “а ku “а dgn 
“odu + “зори + ko + go I и "q а а tdg 
4 оби + “Porqu + “бо + Зо I u ‘a b “а "deo 
gobdu + “otqbu + “8% + “тти + o + 30 І и “а 5 d H4 
204 + “Sot + do I I 4 I I (nu, 
Tom + фол + “отти о + Зо I u 4 ғ “а figo 
foudu + "2o quu + &оа + “оғаш + отти + “оТ + io I и 4 ^q d fg 
&o4bu + осли + ол + «отри + “оти + “tog + фо I и 7) b “4 to 


(sWwa о y f 1 pa 
< rrr E Ie Aer 


Е 


Sorenbg ULIN 20у sanjeA paysedxy үеләцәгу g-p'/ AQEL 
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General Expected Values for Mean Squares. The expected values for 
the mean squares given in Table 7.4-1 are a special case of those given in 
Table 7.4-8. Depending upon the experimental design, D,, D,, and D, are 
either zero or unity. The expected values in Table 7.4-1 are obtained from 
those in Table 7.4-8 by assuming that each of these D’s is zero, i.e., by 
assuming that factors A, B, and C are fixed. 


7.5 Other Multifactor Repeated-measure Plans 
The plans that have been considered in this chapter have the following 
general form: 


әр “ш eor М 
и G, G, б 
u, | б, Gy 25: б» nsubjects per group 
Uy G, G, tt G, 


The analysis of this general form may be outlined as follows: 


минеке ee 


Source df 

Betivecn ЦЕ дей 

U —1 

Subjects w. groups gn = 1) 
Within subjects а) 

V h-—1 

UV (g = Ith — 1) 

V x subj w. groups gn- Di — 1) 


The levels of factor U may, for example, constitute the рф treatment 
combinations in a p X q factorial set. In this case the following subdivi- 


Sions are possible: 


Source df Source df 
U get UV (g — Ih — 1) 
A p AV (р — Dh — 1) 
—1 BV (4 = 1h — 1) 


B 
AB (р Jy -1 | АВУ (p—U(q — 00 – 1) 
icta NE Me AE || үк A у 2 —— 
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Alternatively, U may define the levels of factor A, and V may define the 
treatment combinations in ag х r factorial set. 


subdivisions are possible: 


In this case the following 


—————MÓ 2 


Source df Source df 
12 h—1 UV (g — Ih — 1) 
B 4-1 АВ (р = 1)(4—1) 
(e r—1 AC (p — 1)(r — 1) 

BC (9-10 —1) | ABC (p — Ig — Ir — 1) 
——— O E згн Ц 
Source df 

V x subj w. groups gn — 1)(h — 1) 

B x subj w. groups pi - Iq — 1) 

C x subj w. groups pi — 1r — 1) 


BC x subj w. groups 


pi = D — De — 1) 


Table 7.5-1 Summary of Analysis of Variance 


Source of variation df E(MS) 

Between subjects npq —1 1 4 
А p-1 оў + ro? + nqrsoz 
B 4-1 o? + rso® + пргвоў 
АВ” (p —1)(4—1) 92 + гуа? + пгзоў з 
Subjects w. groups рдп — 1) o? + rso? 

Within subjects npq(rs — 1) 

EAR 4 ESTA 
c r-1 оў + so, + npqso; 
AC (p = Jr — 1) оў + зоё, + пдхо?, 
BC (q = 100 — 1) оў + зоў„ + прзоў, 
ABC | (p = 1)(4 — D(r — 1) оў + soz, + поз, 
C x subj у, groups рап — Ye — 1) oF sob. 
5, 5—1 оў + re$, + прфгоў 
AD (p — Js — 1) о? + rod, + пдго?, 
BD (q — 1s — 1) оў + ғой, + прғод, 
ABD ] (p —1)(4 — 1s — 1) o? + roi, + nrolas 
D х subj w, groups pain — 1X — 1) o + roi, 
CD (r — 1X — 1) 0 + obs, + прдо?, 
42 (р-1Хе-1)%-1) в + RE Lm 

(q — 1r — Ds — 1) 02 + o2,, + прод, 

ABCD е {РОйуб 


CD x subj w. groups 


(p — DG — 00 — 1G — 1) 


2 
02 + 0h, + пой 
2 


Pan — 1)(r — 1)(з5 — 1) o + [M 
————— л». 2 
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As a third case, the levels of factor U may constitute рф treatment com- 
binations in a p x 4 factorial set, and the levels of factor V may constitute 
the rs combinations in ап r x s factorial set. In this case the general form 
specializes to the analysis summarized in Table 7.5-1. In this table, terms 
have been rearranged to indicate denominators for F ratios. The expected 
values given in this table are derived under the assumption that factors 
A, B, C, and D are fixed. 

To illustrate the subdivision of the UV interaction for this case, one may 
first subdivide the U factor as follows: 


А 


Source df 

UV (g = Ih = р 

AV (p = Vh = 1) 

BV (q = Ith = 1) 

ABV (p = Ig = 00 — 1) 
aeae Á 


Then each of the interactions with factor V may be subdivided. For ex- 
ample, the AV interaction may be subdivided into the following parts: 
шас. е5 


Source df А 

AV (p = 0 = 1) 

AC (р = Vr = 1) 

Ар (p = Ys = 1) 

ACD (p = Yr = Vs = 1) 
Mild ————— 


Analogous partitions may be made for the BV and ABV interactions. 

Computational procedures for all treatment effects in the analysis sum- 
marized in Table 7.5-1 follow the usual procedures for а four-factor fac- 
torial experiment. Variation due to subjects within groups is part of the 
Within-cell variation. The latter is given by 


X(ABC Dj, 
SS. oo = BX? – ы, 


Variation due to main effects of subjects within groups is given by 
хр, XB 

rs nrs 
The pooled interaction of treatment effects with subject effects is given by 
= SSw. сеп — SSsubj w. groups- 
This pooled interaction corresponds to the V x subjects within groups 
interaction in the general form of this plan. The C x subjects within 
groups part of this pooled interaction is given by 


(СР — (АВС == SSsubj w. groups. 
s ns 


SSsupj w. groups = 


SSpooted Int w. subj 


580 x subj w. groups = 
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Controlling Sequence Effects. For the plans that have been discussed 
in this chapter, in cases where the sequence of administration of the treat- 
ments was not dictated by the nature of the experimental variables, it was 
suggested that order be randomized independently for each subject. A 
partial control of sequence effects is provided by the use of the Latin-square 
principle; this principle is discussed in Chap. 10. А variety of repeated- 
measure designs using this principle is also discussed in Chap. 10. A more 
complete control of sequence effects (but one which is more costly in terms 
of experimental effort) is available. This more complete control is achieved 
by building what may be called a sequence factor into the design. 

Consider a р x r factorial experiment in which there are to be repeated 
measures on the factor having r levels. The number of different sequences 
or arrangements of ғ levels is r! =r(r — 1)(" — 2)(r — 3):--(1). For 
example, ifr is 3, the number of possible sequences is 3! = 3-2-1 = 6; ifr 
is 5, the number of possible sequences is 5! = 5-4-3-2+1 = 120. Eachof 
the possible sequences may define a level of factor B in which q = r!. 
Thus, instead of the original p x r factorial experiment one hasa p x q X r 
factorial experiment. The analysis of the latter experiment would have 
the following form: 


Source df 
Between subjects npq -1 
A рі 
В (sequence of С) Ф) 121 
AB. У (р = 1) — 1) 
Subjects within groups p(n — 1) 
Within subjects npq(r — 1) 
7 ET 
(p = 00е = 1) 
ВС (9 — Dir — 1) 
ABC { (p — 1)(9 — 1)r — 1) 
C x subj w. groups рап — Yr — 1) 
Ei senio op. culti залаа 


This kind of sequence factor may be constructed in connection with any 
repeated-measure design in which the sequence can logically be varied. 
However, for designs in which the number of levels of the factor on which 
there are repeated measures is five or more the required number of levels of 
the sequence factor becomes prohibitively large. 

One possible method of reducing this number is to select deliberately 
representative sequences from among the total possible sequences. A dif- 
ferent approach might be to take a stratified random sample of all possible 
sequences, where the strata are constructed so as to assure a partial balance 
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with respect to the order in which each of the levels appears within sequences 
which are used in the experiment. This kind of stratification may be 
achieved by using a Latin square. 


7.6 Tests on Trends 

Consider a p x q factorial experiment in which the levels of factor B 
define steps along an underlying continuum, i.e., intensity of light, dosage, 
blocks of trials in a learning experiment. Тһе magnitude of the AB inter- 
action in this kind of experiment may be regarded as a measure of global 
differences in the patterns or shapes of the profiles for the simple main effect 
of factor B. It is often of interest to study more specific aspects of such 
differences in patterns. Toward this end it is necessary to define dimen- 
sions in terms of which relatively irregular, experimentally determined pro- 
files may be described. There are different methods whereby descriptive 
categories for this purpose may be established. In this section such cate- 
gories will be defined in terms of polynomials of varying degree. Other 
functions, such as logarithmic or exponential, rather than polynomials, 
may be more appropriate for some profiles. The latter functional 
forms are not so readily handled as polynomials. However, polynomials 
may be used as first approximations to the latter forms. 

Given the set of means АВд, В...» ABu in a p X 4 factorial experi- 
ment. The line joining these means (the profile of a simple main effect of 
factor B at level a;) may have an irregular shape. Asa first approximation 
to a quantitative description of the shape of a profile, one may obtain 
the best-fitting linear function (straight line). The slope of this best-fitting 
straight line defines the linear trend of the profile. Asa second approxi- 
mation to the pattern of the experimentally determined set of points, one 
may fit a second-degree (quadratic) function. The increase ІП goodness of 
fit Over the linear fit defines what is known as the quadratic trend of the pro- 
file. As a third approximation to the pattern, one may obtain the best- 
fitting third-degree (cubic) fi unction. The increase in goodness of fit of the 
latter function over both the linear and quadratic functions defines the cubic 
trend of the profile. | t қ 

This process of fitting polynomial functions of increasingly higher degree 
may be continued up to a polynomial of degree ~ 1, whereq is the number 
of points in the profile. A polynomial of degree q — 1 will always provide 
an exact fit to q points, since statistically there are only q — | degrees 
of freedom in this set of points. Іп most practical applications of the pro- 
Bu to be described here, the degree of the polynomial is seldom carried 

eyond 3. 1 

Global differences between shapes of profiles for simple main effects of 
factor B give rise to the AB interaction. Differences between the linear 
trends of such profiles define that part of the AB interaction which is called 
AB (linear). Thus the AB (linear) interaction represents а specific part of 
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the over-all AB interaction. Differences between quadratic trends in the 
profiles of the simple main effects define the AB (quadratic) variation. In 
general, the over-all variation due to АВ interaction may be divided into 
nonoverlapping, additive parts. These parts arise from specific kinds 
of differences in the shapes of profiles—differences in linear trends, dif- 
ferences in quadratic trends, etc. Symbolically, the AB interaction may be 
partitioned into the following parts: 


Source of variation df 

AB (p —1)4—1) 
AB (linear) pu 
AB (quadratic) p-1 
AB (degree q - 1) p—i 


а Ра eee 


These parts will sum (о the over-all 4B variation. 
The expected value of the mean Square due to differences in linear trends 
has the following form: 
E(MSasiny) = о? + пой (ил). 


А test on differences in linear trend involves the hypothesis that оё, = 0. 
This is equivalent to a test on the hypothesis that the profiles of the simple 
main effects have equal slopes, i.e., that the best-fitting linear functions are 
parallel, 


The expected value of the mean Square due to differences in quadratic 
trends has the following general form. 


E(MS as quaay) Sorts поз ашай). 


A test on differences in these trends indicates whether or not the experi- 


mena data support the hypothesis that the profiles have equal quadratic 
rends. 


obtained by using the coefficients of orthogonal polynomials associated 


eens of an independent variable. Such coefficients are given in 


The coefficients associated with the linear function will be designated 


ЛЕО ОМО, и,. 


TESTS ON TRENDS 355 


€x сз ox Ck yii с, 
by Gy Gy Gy G 
a 
b, б, Gy, Gy, Gi, 
by Gr G5 Gy б, 
ау 
b, Gra Gy, Gy, Gra 


For example, for the case in which r = 4, the respective coefficients 
are 
КИЕСІ) ЕЕ 


The coefficients associated with the quadratic function having ғ experi- 
mentally determined points will be designated 


v, 


Dis. Vay, cig MU Ше 


For the case r = 4, the respective coefficients are 
1 —1, —1, 1. 


Note that the sum of the coefficients in each case is zero, Hence these 
coefficients define a comparison or contrast among the r points. The 
coefficients associated with the cubic function will be designated 


Wy, М» 6. Wes crea, Wr 


In an experiment of this kind there are three sets of interactions that may 
be divided into parts associated with differences between trends—the AC, 
the BC, and the ABC interactions. Procedures for obtaining the variation 
due to differences in linear trends within each of these interactions will be 
outlined. Higher-order trends will be found to follow the same general 
pattern. It will also be convenient to indicate procedures for obtaining the 
variation due to linear and higher-order trends for the main effect of factor 
C. Thelatter indicate the goodness of fit of polynomials of varying degree 
to the profile corresponding to the main effect of factor C. шеп, 

The notation to be used in obtaining the linear part of the variation in 
interactions with factor C as well as the variation in the linear part of the 
Variation in the main effect of factor C is summarized in part i of Table 

6-1. 
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Table 7.6-1 Notation for Analysis of Linear Trend 


Coefficients for linear comparison: 


Шр Шу оозу Wy, анон Up 


Алау = Ул (Хал) 
АВС), = Ұш (АВС) = E Xha 


© ACi = Ош(АСа) = Уавсу 
BC} = Zu(BCy) = АВС 
GUIDE = ХХАВС, = УУУХ, у, 
= 23 сот | 
(1) = (С) Inpg(Sug) G^) = SEACH "ng Zug | 
Gi) 2) = EGG, PIG) а? = X(BCjInp(uj) | 
(5) = Хавс; ухи) 
Source Computational formula df 
Within subjects (linear) (29 npq 
itn subjects (linear) 
(iii) C (linear) a’) 1 
AC (linear) (3°) — (1) "Эрте! 
BC (linear (4) — (17) 4—1 
ABC (linear) 690-8)-4)-) (p-Dq-D | 
C x subj w. groups (linear) (2’) — (5) pq(n — 1) 


The symbol X;,;, is a weighted sum of the ғ observations on subject m(ij), 
the weights being the respective linear coefficients of the appropriate poly- 
nomial. The analysis of linear trends for this case reduces to what is 
essentially an analysis of variance of a р X q factorial experiment with n 
Observations per cell, an Observation being an Xo 

Other symbols defined in part i are also weighted sums. То illustrate, 
АВС) a weighted sum of terms appearing in a row of an ABC summary 
table. For example, the row which corresponds to level a, and level b, has 
the form 

ABC ABCg, «5 “ABC, s ABC,,,. 


Each of these totals is the sum of n observations. From these totals, 
ABC;, = u\(ABC,,,) + и(АВС) + + и (ABC). 
An equivalent expression for ABC; is 
АВС); = Ха) + Xi) bo с Ха) = УХ. 
т 


Опе expression serves as a computational check on the other, | 
Computational symbols ‘convenient for use in the computation of the 
linear sources of variation are given in part ii. In the denominator of all 
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symbols is the term uz. The other term in the denominators is the number 
of observations that goes into an element which is weighted in the weighted 
sum. Forexample, there aren observations in A BC,,,; there are np observa- 
tions in AC,,; there are ng observations in BC. Actual computational 
formulas are summarized in part їй. The mean square due to AC (linear) 
estimates an expression of the form 


>); =p) 


where the fs represent regression coefficients for linear profiles corre- 
sponding to simple main effects of factor C at each of the levels of factor 


Table 7.6-2 Notation for Analysis of Quadratic Trend 
bb ee ea ТУ So C SSS eee 


Coefficients for quadratic comparison: 
Up Us ...» Uks fe Cp 


Хан) = Урт) 
ABC; = Zvl ABCs) - Xman 


ij 


y AC! = ®дАС„) = YABG, 
BC} —Xe(BC;) = XABC; 
C = Xn, = SABC) = УУУХ 
ШЕЛ: 2 -.-.-... а ы, 
(1^) = (C’*/npq=2}) (3^) = X(AC; Y Ing Ev) 
Gi) Q^) = A(X han) E) (4°) = Х(ВС;) про) 


(5^) = ХАВС()Үһ(Х 
ew) Ет ел а Е 
Computational formula 


Source 
Within subjects (quadratic) Qhan 
pi C (quadratic) a”) д 
(iii) AC (quadratic) 6) — (1 2 
BC (quadratic) (4) - 0» р 
ABC (quadratic) (59 — 3") — (4^) + (^) 
Q^ - 6) 


C x subj w. groups (quadratic) 


A, and where f represents a pooled regression coefficient for all linear pro- 
files in the set. Ee 
Computational procedures for the quadratic trend are summarized in 
Table 7.6-2. Each of the entries in this table has a corresponding entry in 
Table 7.6-1, with v, replacing corresponding tr throughout. The symbol 
Xap is used to distinguish a weighted sum in terms of quadratic weights 
from the corresponding sum in terms of linear weights, designated 
by the symbol Ху Higher-order trends follow the same pattern, with 


mij)" 
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the appropriate coefficients serving as the weights. If, for example, ғ = 4, 
it will be found that 


SS, = SSeqin) + SSe(quaay + SSe(eunic). 
Similarly, 58а = $$ас(ип)у + SSac(quaay + SSac(eubic). 


Numerical Example. The numerical data in part i of Table 7.6-3 will 
be used to illustrate the computational procedures. These data representa 
2 x 2 x 4 factorial experiment with repeated measures on factor C, three 
observations in each group. Suppose that factor C represents equally 
spaced blocks of trials in a learning experiment. For example, subject 1 is 
assigned to treatment combination ab, and has scores of 1, 6, 5, and 75 
respectively, on a series of four blocks of trials. 

Since factor C has four levels, coefficients for the case in which there are 
four points to be fitted are appropriate. From Table B.10 the linear 
coefficients are 3 TEES 


These coefficients appear at the top of parti. From the data on subject 
1 one obtains 
Хаа» = (—3)(1) + (—1)(6) + (15) + (3)(7) = 17, 


the weighted sum of the observations on subject 1, the weights being the 
linear Coefficients. Other entries in the column headed Хм) are obtained 
inananalogous manner. For example, 


Xia» = (—3)(2) + (—1)(7) + (1)(12) + (3)(15) = 44. 

The entries in the column headed ABC; are obtained as follows: 
АВС, = Xiay + Ха + Ха) = 17 + 28 + 18: = 63. 
ABCis = Xian + Xsan + Хау = 44 + 26 + 27 = 97. 


_ The left-hand side of part ii represents an ABC summary table obtained 
in the usual manner from data on the left-hand side of parti. From the 
first row of the ABC summary table one obtains 

АВС, = (—3)(4) + (—1)020) + ( 000) + (3)(25) = 63. 


This entry provides a check on the entry ABC;, computed in part i. The 
second row of the ABC Summary table is used to obtain ABC/,. А cor- 
responding entry is available from part i. è 

The left-hand side of part iii represents an AC summary table. The 
entries in the column headed AC; are weighted sums of the entries in the 
corresponding rows. Checks on these entries may be obtained from part 


ii. Checks on entries in part iv may also be obtained from the ABC} 
column of part ii. For example, 


BC, = АВС + ABC, = 63 4-92 = 155; 
BC; = АВС + ABCh = 97 + 118 = 215. 
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Table 7.6-3 Analysis of Linear Trend—Numerical Example 


Linear coefficients: —3 — —1 1 3 Iu – 20 


Subject | c са 6&6 ea Xman АВС), 
1 1 у сой 17 
b, 2 О бб ТО 28 
3 g-ig пем Ж Bio) 18 63 
а 
4 5o 7 r12 is 44 
b; 5 1-7 Hé Mes 50 26 
s 6 3 ТОТ! 27 97 
1 
7 1 дең ЧТ 1732 38 
b, 8 1 1——74—710 30 
9 1 TOS MES 24 92 
аҙ 
10 о? 36 
ba 11 a 210) 2115 44 
12 ОККО ү ln МЗ 38 118 
20 50 90 130 370 370 
OR ОНЕ Еа 7 30 уузу ee ee 
City Comey Cah 00 АВС), AC; 
b, 1 12094920. 25 63 
а bz EET ETE: 97 160 
Gi) 
b, teo Pr nr 92 
а Ж 7 6 25 40 118 210 
20 50 90 130 370 370 
listens ПЕТ Se ба ee nc olet co ME Mal | apo SS 
с са C3 C4 AC} [04 
(ii) a 10 40 50 60 160 
а 10 10 40 7 210 370 
20 50 90 130 370 
JN ДЕН ое вва Е аайы | ш 
Ау fm BC; с 
(iv) by 7 532430055 255 155 
bs 13} 26, 15581995 215 370 
20 50 90 130 370 


All the totals required in the analysis of linear trend may be obtained from 
the Хуу column of part i of Table 7.6-3. Hence parts ii to iv are not 
actually required. In practice, however, part ii should be computed to 
serve as a check on the Хуу column. Additional checks are provided by 


parts iii and iv. 
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Computational symbols defined in part ii of Table 7.6-1 are obtained in 
part i of Table 7.6-4. For the case in which there are four points in each 
profile, Xu; = 20. Each total C, is the sum of npq = 12 observations. 
Hence the denominator for symbol (1’) is 12(20). Since each total AC, "o 
the sum of лд = 6 observations, the denominator for the symbol (3^) is 
6(20). 

( ) Table 7.6-4 Analysis of Linear Trend—Numerical Example 


(17) = (370)?/12(20) = 570.42 
(2) = (172 + 28° + --- + 44? + 38%)/20 = 616.70 
(0 (3’) = (160? + 2102)/6(20) = 580.83 
(4) = (155? + 215%)/6(20) = 585.42 
(5) = (63° + 97? + 922 + 118%)/3(20) - 596.10 
Source of variation SS df} MS Ж 
Within subjects (linear) (2’) = 616.70 | 12 
C (linear) (1^) = 570.42 | 1 | 570.42 | 221.09* 
(ii) АС (linear) (3) – (1) = 10.41 | 1| 10.41] 4.03 
BC (linear) (4) —(1) = 15.00 | 1| 15.00] 5.81* 
ABC (linear) (5) -G)-@)+d)= 0.27 | 1] 027 
C x subj w. groups 
(linear) (2) – (5) = 2060 | 8| 2.58 


, The analysis of variance for the linear trend of the within-subject effects 
Is summarized in part ii of Table 7.6-4. For the basic data in part i of 
Table 7.6-3, there are npq = 12 subjects and r = 4 observations on each 
subject. Hence the total degrees of freedom for within-subject effects are 
npq(r — 1) = 36. Inthe analysis of variance of trend, these 36 degrees of 
freedom are partitioned as follows: 


— 22-3888 eee 

Within subjects 36  npq(r - 1) 
Within subjects (linear) 12 npq 
Within subjects (quadratic) 12 npq 
Within subjects (cubic) 12 npq 


The 12 degrees of freedom for the linear trend of the within-subject effects 
are analyzed in part ii of Table 7.6-4. 
A test on linear trend in the main effect of factor C has the form 


MS, 
F = ссн) 00, 


MSc x subj w. groups(lin) 
At the .05 level of significance, this test indicates that the best-fitting strai ght 
line to the profile of the C main effects has a slope which is significantly 
different from zero. The profile corresponding to the C main effects (in 
terms of treatment means) is shown in Fig. 7.3. 
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The profiles corresponding to the simple effects of factor C at levels b, 
and b, are also shown in this figure. Inspection of these profiles suggests 
that the best-fitting line to the profile for b; would have a different slope from 
the best-fitting line to the profile for b,, that is, that these lines would not be 


14 


12 — | = | lone 


0 
с, 
1 с; ES 4 


Figure 7.3 Profiles of BC interaction and C main effect. 


parallel. А test of the hypothesis that there is no difference in these slopes 
(no difference in linear trend) has the form 


Маъоз = 581. 


Е 
MSc x subj w. groups(lin) 
At the .05 level of significance, this test indicates that the linear trends of the 
BC profiles cannot be considered to be equal. Er 
Profiles corresponding to the AC interaction are shown in Fig. 7.4. In- 
Spection indicates that these profiles differ in shape but that the best-fitting 
straight lines might be parallel. The test of the latter hypothesis is given by 


MSaeviin) = 4.03. 


E 
MSc x subj w. groups(lin) 

The value does not exceed the critical value for a .05-level test. Hence the 

experimental evidence does not reject the hypothesis that thelinear trends are 

equal. Thus differences in shapes of these profiles, if there are statistically 

Significant differences, must be due to quadratic or higher-order trends. 
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A summary of the analysis of variance for the quadratic trend is given in 
Table 7.6-5 and Table 7.6-6. This analysis follows the same general pro- 
cedures as those used in the analysis of the linear trend; in this case the quad- 
ratic coefficients replace the linear coefficients as weights. The test of 
differences in the quadratic trends of the AC profiles is given by 


F = ——MSectquas) ув 36, 
MSc x subj w. groupsiquad) 
Hence the data tend to reject the hypothesis that there is no difference in the 
quadratic trends of the AC profiles. Inspection of Fig. 7.4 indicates that 
the profile at a, clearly has a different curvature from the profile at a,. Тһе 


Figure 74 Profiles of AC interaction, 


significant F ratio for the main effect in Table 7,6-6 indicates that the quad- 
ratic trend in the profile of the C main effect is different from zero. 

In general, the analysis of differences in trends for interaction terms is not 
made unless there is evidence to show that some difference in shapes exists. 
This evidence is provided by the usual over-all tests on interactions, which 
indicate the presence or absence of global differences in the shapes of pro- 
files. The over-all analysis of variance for the within-subject effects is 
summarized in Table 7.6-8. This analysis is made by means of the usual 
computational procedures for a p х q x r factorial experiment having 
repeated measures on factor C. The basic observational data in part i of 
Table 7.6-3 and the summary tables at the left of parts ii to iv are used in the 
over-all analysis of variance. The significant AC and BC interactions 
indicate that there are global differences in the trends of the corresponding 
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Table 7.6-5 Analysis of Quadratic Trend—Numerical Example 


Quadratic coefficients 


(1) = (10)/12(4) 
(2) = (=3 + (—4 ce + 6 = $6.50 


© (3°) = [(—20/ + 307064) “ЗЫ 
- 2% 

4) = + тєл) 
59 Сір 4 (79) + MI e 1694) = 54.50 


52-07 
07 аз- 00 
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Table 7.6-7 Analysis of Cubic Trend—Numerical Example 


Source of variation SS df MS F 
Within subject (cubic) 20.30 12 
C (cubic) 0.42 1 0.42 
AC (cubic) 10.42 1 10.42 30.64* 
BC (cubic) 6.66 1 6.66 19.59* 
ABC (cubic) 0.07 1 0.07 
C х subj w. groups (cubic) | 2.73 8 0.34 


——————— Rue dd 
Table 7.6-8 Ovyer-all Analysis of Variance for Within Subjects 
Effects—Numerical Example 


Source of variation SS df MS ғ 
Within subjects 693.50 36 

С 572.92 3 190.97 180.16* 

AC 72.92 3 24.30 22.92* 

BC 22.00 3 7.33 6.92* 

ABC 0.33 3 0.11 

C x subj w. groups 25.34 24 1.06 


ML T. за S PETER И 


profiles. The nature of such differences is explored in the analysis of the 
linear, quadratic, and cubic trends. 

It is of interest to compare the over-all analysis with the analysis of the 
individual trends. It will be noted that: 


ee Ee 


55 df 
Within subjects (over-all) 693.50 36 
Within subjects (linear) 616.70 12 
Within subjects (quadratic) 56.51 12 
Within subjects (cubic) 20.30 12 
It will also be noted that: 
eee ЖҮЗ” Т-- 
SS 

AC (over-all) 72.92 

AC (linear) 10.41 

AC (quadratic) 52.09 

AC (cubic) 10.42 

Se ШЕ 


In each case the sum of the parts will be numerically equal to the corre- 
sponding over-all variation. 
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It is of particular interest to look at the parts of terms that go into the 
denominator of the F ratios. In this case: 


55 MS 
C x subj w. groups (over-all) 25.34 1.06 
C x subj w. groups (linear) 20.60 2.58 
C x subj w. groups (quadratic) 2.00 0.25 
C x subj w. groups (cubic) 2.73 0.34 


The parts of the over-all variation of this interaction need not be homo- 
geneous; each of these parts does not estimate the same source of variation. 
The parts measure the deviation of a weighted sum about the mean of the 
weighted sums; the weights are different for each of the parts. The expected 
values of the mean squares of the parts are to some extent dependent upon 
the weights; the latter in turn determine the shape of the curve that is being 
fitted to the experimental data. Different structural models underlie the 
analysis of variance for linear, quadratic, and cubic trends. 

Illustrative Applications. Grant (1956) gives a relatively complete 
account of tests for trends as well as a detailed numericalexample. The plan 
for the experiment discussed by Grant has the following form: 


CETTE NEST EUCCUFT INCUN Ho cut SS 


Stages of task 
Shock E 

сі C с C4 C5 
by Gu Gy Gy Gy Gu 
а by Сі біз Сі Gig 12 
b. 13 13 13 13 13 

Anxiety 3 parem 
level b, Са Сц Gy Ga Gn 
ds by Gs» Gyo С» Gry Gog 

» С, G. 
bs Gis Gog Gog °з 23 

Et Бы JETER E E ху = жаты 


Factor 4 represents the level of anxiety of the subjects as measured by the 
Taylor Manifest Anxiety Scale. Groups Gi; Су», and Gj, represent sub- 
jects from one end of the scale; groups Gs, Сз», and Gs; represent subjects 
from the other end of the scale. The three levels of factor B indicate the 
number of electric shocks received by a subject in the course of the experi- 
Ment. Subjects in groups б and С, were at level b,, subjects in groups 
Gis and С, at level by, and subjects in groups біз and Сз at level b. The 
levels of factor C represent successive stages of acard-sorting task. There 
Were four subjects in each group. The criterion was the square root of the 
number of perseverative errors. This plan may be considered as a special 
case ofa p x q x r factorial experiment with repeated measures on factor C. 
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The over-all analysis, as given by Grant, had the following form: 


SS df MS | Е 
Between subjects 23 
Groups 21,91 5 4,38 2.10 
Subj w. groups 37.56 18 2.09 
Within subjects 96 
C stages 16.37 | 4 4.09 5:378 
Stages x groups 35.26 20 1.76 2.31 
Stages x subj w. groups 54.89 72. 0.76 


In this analysis the six combinations of anxiety and shock are considered to 
be six levels of a single factor—a “group” factor. The group factor is then 
subdivided as follows: 


SS df | MS | F 
Groups 21.91 5 | " 
A anxiety level 1.75 1 1.75 0.84 
B shock 8.86 2 4.43 2.12 
AB 11.29 2 5.65 2.71 


The denominator for each of the above F ratios is IMS ow arouse? 

The 20 degrees of freedom for the stage x group interaction are associ- 
ated with what Grant calls the between-group trends. This interaction is 
subdivided into the following parts: 


SS df MS F. 

Stages x groups 35.26 20 
AC 17.58 4 4.40 5.79 
BC 10.62 8 1,33 1.75 
ABC 7.07 8 0.88 1.16 


The denominator for each of the above F ratios is MS stages x subj w. groups" 
These tests indicate whether or not there are any global differences in the 
shapes of corresponding profiles. 

The more specific analysis of differences in trends associated with factor 
С is summarized in Table 7.6-9. Іп each case the denominator for tests ОП 
differences in linear trend is the linear part ofthe C х subjects within groups 
interaction. Similarly, the quadratic part of this latter interaction serves a8 
the denominator for differences in quadratic trend. 
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It will be noted that, although the over-all test on the BC interaction indi- 
cates no significant difference, the test on differences in linear trends does 
indicate a statistically significant difference. In the absence of a significant 
over-all interaction the experimenter would not ordinarily make tests on 
the parts—unless a priori information about the underlying sources of 
variation in the experimental variables indicates that certain of the trends 


Table 7.6-9 Analysis of Variance of Grant Data (1956) 


Source of variation SS df MS F 
Within subjects 96 
AC 17.57 4 4.39 5.78* 
Linear 16.38 1 16.38 37.99* 
Quadratic 0.04 1 0.04 0.05 
Cubic 0.29 1 0.29 0.26 
Quartic 0.86 1 0.86 1.52 
BC 10.62 8 1.33 1.75 
Linear 6.84 2 3.42 7.93% 
Quadratic 1.00 2 0.50 0.55 
Cubic 2.29 2 1.14 1.00 
Quartic 0.49 2 0.25 0.44 
АВС 7.07 8 0.88 1.16 
Linear 2.05 2 1.02 2.37. 
Quadratic 1.48 2 0.74 0.81 
Cubic 0.24 2 0.12 0.10 
Quartic 3.30 2 1.65 2.93 
C x subj w. groups 54.89 72 0.76 
Linear 7.16 18 0.43 
Quadratic 16.43 18 0.91 
Cubic 20.55 18 1.14 
Quartic 10.15 18 0.56 
IE) uk Sn р М ФИН MATERNO ЫЕ Су сл 2-5. 


should be тоге dominant {һап others. However, oneshould not hesitate to 
present a complete description of the experimental findings, even though 
Some of the “tests” оп parts of nonsignificant over-all variation may be 
unduly subject to type I error. Loro MS d 
Another illustrative example of the analysis of trends is given in an experi- 
ment reported by Schrier (1958). The plan for this experiment may be 
Tepresented as shown on page 369. In this design there are repeated 
Measures on both factors A and B. The two levels of factor A represent 
two blocks of trials. The four levels of factor B represent four amounts of 
reward. The criterion measure was the proportion of correct discrimina- 
tions. To obtain homogeneity of variance, an arcsine transformation on 


the proportions was used in the analysis. 
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Table 7.6-10 Analysis of Variance of Linear Trend 


Linear coefficients: и;, 45, ... uj... Ug 
Xim = Хи Хуш АВ; - > Xim 
Pm =E Xim - EP. =} AB; 
n m i 
w= (лра) (3) = X(ABD*In(Zuj) 
Q^) = ха, (си) (4) = ХР, pE) 
Source of variation SS df 
B (linear) @) 1 
B x subjects (linear) (4) — (1) n-—-l 
AB (linear) (3) — (1) р 1 


АВ х subjects (linear) (2) – (3) — (4) + (1) | (л – 1)(р — 1) 


Table 7.6-11 Analysis of Variance for Schrier Data (1958) 


eee eee ees 


Between subjects 
Within subjects 


U 


V 
5 
Vs 
Vs 


Wy 
Ws 


Source of variation df MS F Denominator 
4 117.8 
ES 
Blocks 1 379.8 | 23.2* U 
Blocks x subjects 4 16.3 
Rewards 3 446.1 | 16.2* V 
Linear 1 1290.5 | 18.5* Vy 
Quadratic 1 1.5 Vo 
Cubic 1 46.1 | 5.7* Vs 
Rewards x subjects 12 27.6 
Linear 4 69.6 
Quadratic 4 4.6 
Cubic 4 8.2 
Blocks x rewards 3 41.2 | 62% Ww 
Linear 1 750 | 66 Wy 
Quadratic 1 2 Wa 
Cubic 1 48.6 | 16.6* Ws 
Blocks x rewards x subjects 12 6.6 
Linear 4 11.5 
Quadratic 4 5.6 
Cubic 4 3.0 
ee шш ше тел Sl al ae — —— 
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Trial: а аз 
Subjects 
Reward level: bi bs bs b, bi by 8 by 
1 
1 | 
3 


Since there are repeated measures on two factors, computational pro- 
cedures for this design differ somewhat from those given in Table 7.6-1. 
General procedures for the analysis of linear trends with respect to factor 
B are outlined in Table 7.6-10. If the logic of the design permits, a 
trend analysis may also be carried out with respect to factor 4. (In the 
case of the Schrier plan factor A has only two levels. Higher-order 
trends may be analyzed by following the general pattern in Table 7.6-10, 
replacing the linear coefficients with appropriate higher-order coefficients 
throughout. 

The analysis of variance reported by Schrier is summarized in Table 
7.6-11. Denominators for F ratios are indicated in the right-hand column. 
For example, the denominator used in the test for differences in cubic trend 
of the blocks x rewards interaction is the entry in row Ws. The latter 
is MShiocks x rewards x subj(oubieyr The experimental data indicate that the 
differences in cubic trend in the block x reward interaction cannot be con- 
Sidered to be zero; i.e., the shapes of corresponding profiles do not have 
equal cubic curvature. 


7.7 Testing Equality and Symmetry of Covariance Matrices 


Consider the following p x q factorial experiment in which there are 
Tepeated measures on factor B. 


bi by 7. b, 
а б, б, oa б, 
аз С Cian чы б; 
а, с, с, fuv G, 


Assume that there are n, subjects in the group (С) assigned to level a;, nz 
Subjects in the group (G;) of subjects assigned to level as, etc. Let 
Мет +m n, 
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The following additional notation will be used in this section: 


5, = q X q matrix of covariances for level а). 


5, =q X q matrix of covariances for level ay. 


5, =q X q matrix of covariances for level a,. 


Spootea = 4 X 4 matrix of pooled covariances; i.e., 
each entry is a weighted average of 
corresponding entries in S, through 5,, 
the weights being the corresponding 
degrees of freedom. 


The variables included in the covariance matrix at level a; are assumed to 
have an underlying q-variate normal distribution. 

The test procedures to be described in this section are those suggested 
by Box (1950). To test the hypothesis that the covariance matrices 
S, $,...,.$, are random samples for populations in which the 
covariance matrices аге X, = X, = ·.. = 2, = У (that is, that the 
population covariance matrices are equal), one computes the following 
statistics: 


(1) М, — N In |Spooteal = En, In 18,1, 
(2) едік 0451534 25 1, ЕР A 1), 

6(q + (p — 1) L An, N. 
(3) Ж 49+ Ip = 1) 


2 


Under the hypothesis that the multivariate normal populations have equal 
covariance matrices, the statistic 


(4) 4=(1—C)M, 


has a sampling distribution which is approximated by a chi-square dis- 
tribution having f, degrees of freedom. Rejection of this hypothesis 
rules against pooling covariance matrices. If the populations have а 
common covariance matrix E, then Spota is an unbiased estimate of X. 
This test procedure is a multivariate analogue of Bartlett's test for homo- 
Rec of variance. Its power is adequate only if each n, is large relative 
0 4. 

The model under which the usual F tests in a repeated-measure factorial 
experiment are valid not only assumes that the matrix of covariances within 
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each of the populations is Х (the same for each of the populations) but also 
that X has the following form: 


c? pè +++ рай 

p? oF +--+ ро? 
ЕН Ж А 

ГОА „ДА 


That is, each entry оп the main diagonal is equal to o°, and each entry off 
the main diagonal is equal to po?. If, in fact, X has this form, then the 
matrix 


var COV =, COV 

cov var >... cov 
So = è 2 „= 

cov coy --:* var_| 


where Var = mean of entries on main diagonal of Spootea, 
cov = mean of entries off main diagonal of Syootca, 
provides an unbiased estimate of X. 
To test the hypothesis that X has the form given above, one computes the 
following statistics: 


© M, = —(N — p) In rented, 
ISl 
0%24 — 3) 
6) C qq + 2 
2 ON = pg — 000 +4 — 4 
24-4 
(7) epik A STA 
2 
Under the hypothesis that X has the specified form, the statistic 
(8) T5 = (1 — С.М; 


has a sampling distribution which can be approximated by a chi-square 
distribution having f, degrees of freedom. TM 

The computation of the statistics defined in (4) and (8) will be illustrated 
through use of the numerical data given in Table 1.1-1. In this table, 
Р =2, q = 3, and п, =n, = 5. The covariance matrices obtained from 
the data in part i of this table are given at the left in Table 7.7-2. j For 
example, the entry 1.75 in the covariance matrix at level a; is the covariance 
between the observations in columns b, and b; at level ay. (The symbols 
in part iii of Table 7.7-1 are defined in Table 7.2-3.) 
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Table 7.7-1 Numerical Example 
————Á— — E i a 
Subject bi b, bs Total 
1 4 7 2 13 
2 3 5 1 9 
а 3 7 9 6 22 
4 6 6 2 14 
5 5 5 1 11 
1 25 32 12 69 
@ 
6 8 2 5 15 
4j 4 1 1 6 
а 8 6 3 4 13 
9 9 5 2 16 
10 7 1 1 9 
34 12 13 59 128 =G 
———— N a тыны 
b, by by 
Gi) a 25 32 12 69 
а 34 12 13 59 
59 44 25 128 
ee, 2—2. 275. ЫЕ 
(2a,) = 397 (2а) = 333 
(iii) (3a,) = 317.40 (3а) = 232.07 
(5a,) = 358.60 (Say) = 293.80 
(бау) = 350.33 (6a) = 255.67 
с лл ie c. s 
SSB subj w.a, = (2a,) - (5ау) — (бау) + (За) = 5.47 
B x subj w. ag = (2а) - (Sag) - (баз) + (Заз) = 15.60 
ў SS (pooled) — 
iv B xsubj (P 21.07 
6%) MS x subj w. a, = 5.47/8 = 0.684 
В x subj W. dy = 15.60/8 = 1.950 


MS p x subj (pooled) = 21.04/16 — 1.315 


елле EE ee >, 


Table 7.7-2 Covariance Matrices Associated with Data in Table 7.7-1 


ау аз Pooled 
в by b DES Do Dr Бу 10% 
Ы | 250 1.75 250 bı | 3.70 210 115 b, | 3.10 192 182 
by | 1.75 2.80 330 ba | 2.10 2.80 070 ba | 1.92 2.80 200 
bs | 2.50 3.30 4.30 ba | 115 0.70 3.30 by | 1.82 2.00 3.80 


Var — соу = 0.68 


Var — соу = 1.95 


var — cov = 1.32 
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The matrix corresponding to S\ooiea is given at the right of Table 7.7-2. 
The entry 3.10 is the mean of corresponding entries at levels a; and a», thatis, 
(2.50 + 3.70)/2 = 3.10. Similarly, the entry 1.92 is the mean of corre- 
sponding entries at a, and а. The numerical values of the determinants 
corresponding to S;, Sy, and Spooiea are 


|S,| = 108, |9] = 17.51, — |Syooteal = 11.28. 
The statistic defined in (4) is obtained from 


M, = 10 In (11.28) — 5 In (1.08) — 5 In (17.51) 
= 10(2.421) - 5(0.077) - 5(2.863) 


= 9,510, 
Ar E i; E i: 1) та 
koc 30%) =6. 
Hence ү = (1 — .325)(9.510) = 6.42. 


If = .05, the critical value for the test of homogeneity of population co- 
variances matrices is 7°,(6) = 12.6. Since the observed chi-square statistic 
does not exceed the latter value, the hypothesis of homogeneity of соуагі- 
ances may be considered tenable. An unbiased estimate of X is given by 


pooled” 
The matrix S, is given by 


3:23: 191 1:91 
$ = IE 323 1.91|. 

1.91 1.91 323 

Note that 
var — cov = 3.23 — 1.91 = 1.32. 
Within rounding error, this is the value of MS; x subj computed in part iv 
of Table 7.7-1. The numerical value of the determinant corresponding 
to S, is 
ISo] = 12.30. 


To obtain the statistic defined in (8), 


11.28 
шз Л зү б ЖЕШ бун 1692; 
M, = —(10 — 3) 1n 50 


— AB) _ 9.47. 
* 6(8)(2)(8) 


oA 
P 


Hence 18 = (1 — :047)(.692) = .659. 
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The critical value for a .05-level test on the hypothesis that all the diagonal 
values of X аге о? and all the off-diagonal entries are рО (3) = 7:8; 
These data do not contradict this hypothesis. 

It is of interest to note the following relationships: 


MSsupjw. a, = 8.23, 
Var, +(q — 1) соу, = 3.20 + (2)(2.52) = 8.24; 
MSsubj w. a, = 5.90, 


Vata, + (4 — 1) cov,, = 3.27 + (2)(1.32) = 5.91. 
In general, within rounding error, 
MBSsubj w. a, = var, + (9 — 1) соу, . 


7.8 Unequal Group Size 
Consider the following design: 


с С 
Group 
by ba ы 5 size 
а б, G, б, С, п 
аз Gz б, с, б, т 
ap Gy с, с, с, Ny 


Total number of subjects = № 


If the levels of factor A represent different strata within a specified popula- 
tion, then the л, may be proportional to the number of individuals actually 
in each of these strata in the population. In this case, a least-squares 
solution for the effects and the sums of Squares is appropriate. However, 
if the original plan for an experiment calls for equal group size, but the 
completed experiment does not have equal group size because of conditions 
unrelated to the treatments per se, then an unweighted-means solution is 
the more appropriate. Both types of solution are considered in this 
section. 

Because the cell frequencies will be proportional by columns, the least- 
squares solution can be obtained quite simply. Computational symbols 
for this case are defined in Table 7.8-1. The sums of squares are obtained 
by the relations given in Table 7.3-6, using the symbols as defined in Table 
7.8-1. Тһе degrees of freedom for the sums of squares are obtained from 
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Table 7.8-1 Unequal Group Size—Least-squares Solution 


= = У[(АВ„)°[п; 
(0. Nor Oss 
(2) = =X? (7) = т 
88%) 
(3) = (43 /n,) @) = 5208 
qr 
УВ} yf ABCs)? 
@ = xe "3 А as ] 
.xd E 
(5) = 74 (10) - F 


those given in Table 7.3-1 by replacing np with N throughout, where 
М = Хп. For example, 


рп = 1) becomes N~p, 
p(n — 1)(9 —1) becomes (№ — pYq — |). 


The starting point for an unweighted-means solution is an ABC' sum- 
mary table in which a cell entry is a mean; that is, 


АВС, 
АВС, = ттері 


From this summary table, 4B’ and АС” summaries are computed in the 


usual manner; that is, 
раса АВС, 
k 


А; = УАВ). 
2 


Computational symbols appropriate for this case аге given in Table 7.8-2. 
Only those sums of squares which do not involve the subject factor are 


Table 7.8-2 Unequal Group Size—Unweighted-means Solution 


(1) = С?/рдг (6) = EAB; Mr 


© 3) = (E40) (T) = EACH д 
4) сви (8) = BXBCI p 

(5) = Elpa (9) = Хабл) 

SS, = a3) — (171 55, = #06) — (3) — (4) + 07 

6) SS =m) — SSe =A) - G) = 60 +09) 
SS, = (5) ()) _ 55, = (8) — 4) — GO + (D) 


SSa, = #109) — (6) — (7) — (8) + (3) + 4) + (5) — 15] 
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Table 7.8-3 Numerical Example 


— eee 


Subject bi bs bs Total 
1 3 6 9 18 

а, 2 6 10 14 30 n = 3 
3 10 15 18 43 
4 8 12 16 36 
5 3 5 8 16 

а» 6 1 3 8 12 ng = 5 
P 12 18 26 56 
8 9 10 18 37 
9 10 22 16 48 

10 3 15 8 26 EU: 

аз 11 7 16 10 33 i 

12 5v 420 12 37 

77 4527 163 392 № = 12 


E e O a ж 


given here. Sums of squares which do inyolve the subject factor are 
identical with those in a least-squares analysis. 
In a least-squares solution, 


SStotat = SSetween cels + SSyithin ces: 


This relationship, however, does not hold for an unweighted-means solu- 
tion. If the л, do not differ markedly, both types of solution lead to 
numerically similar final products. 


Table 7.8-4 Computational Procedures 


AB summary table AB’ summary table 
b, b, by Total bi by bs Total 
жа а exile vie cy а, 6.33 1033 13.67 30.33 
@ а | 33 48 76 157 аз 6.60 9.60 15.20 31.40 
аз | 25 73 46 144 аз 625 18.25 11.50 36.00 
77 152 163 392 19.18 3818 4037 97.73 
ee ar ЭТ 
(1) = GIN — 4268.44 (1) = G?|pg = 1061.24 
ta  Qexxs = 5504 A 
б) (3) = X(42/n,g) - 429138 (3% = (ZAj9)g = 1067.29 
(4) -(EB)JN — 4633.50 (4) = (ЕВ?)/р = 1151.77 
(5) = =[(4B,;)?/n,] = 4852.30 (5) = =(4B!,)? = 1204.78 
(6) = (2P2)/q = 4904.00 
I c | a ы DR ч 
(іі) iy = <P 2 


^ Xm) B dp d ~ 3.830 
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Table 7.8-5 Analysis of Variance for Numerical Example— 
Unweighted-means Solution 


Source of variation Computational formula SS df | MS F 
Betusen subjects п 

T IG) — (17 23.17 | 2| 1129 

Subjects w. groups (6) — (3) 612.62 | 9 | 68.06 
Within subjects | n 

B T" AG?) — (17)] 346.73 2 | 173.36 | 79.89 

АВ (5) — (3) — (4) + 171] | 179.86 | 4| 4497 | 20.72 

В х subjects w. 

groups (2) — (5) — (6) + (3) 39.08 | 18 2.17 

Миы сз” | үсү кыы ыы eee o 


The computational procedures that have been discussed above can be 
specialized to the case of a p x q factorial experiment with repeated 
measures on factor B. This is done by setting r = 1 and dropping all 
terms involving factor C. The starting point for an unweighted-means 
analysis in this case is a summary table in which 


A numerical example of this case is given in Table 7.8-3. 

A summary of the computational steps in obtaining 
squares and the unweighted-means solutions appears in Table 7.8-4. 
Symbols associated with the least-squares solution appear at the left. The 
unweighted means analysis is summarized in Table 7.8-5. The two 
solutions are compared in Table 7.8-6. 

Individual comparisons for the case ofapxq factorial experiment 
having repeated measures on factor B will be outlined in what follows. 


For the case of the least-squares solution, 


both the least- 


ы A; 
Т! 


i 


т ҮН; 
апа В, = 


Table 7.8-6 Comparison of Solutions 


Unweighted Least 
Source Keans squares 
23.17 22.94 


A 
B 346.73 365.06 
AB 179.86 195.86 
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F ratios in tests on individual comparisons take the form 


Jm 4.) 
к= (А, = ) —. 
MS iiv w. groupe (= p: zz) 
F (В, ан By 


MS; x subj w. groups(2/N) 
For the case of the unweighted-means solution, 


A; 3 i 
A;=— ad = 5 


F ratios in tests on individual comparisons take the form 
CEED 
MS ( Da ) қ 
‘subj w. groups "4 тк "4 
Pa Т ААЙ 
MS, X subj w. groups (2/5, p) 


F= 


CHAPTER 8 


Factorial Experiments in Which Some of the 
Interactions Are Confounded 


8.1 General Purpose 


Precision of estimation requires that treatment effects be free of between- 
block variation. In this context, a block is a person, a group of people, a 
period of time, a source of experimental material, etc. Since the number of 
treatment combinations in a factorial experiment increases rapidly with 
either the number of factors or the number of levels of the factors, observing 
all treatment combinations within the same block often demands unwork- 
ably large block capacity. In this chapter, techniques will be considered for 
assigning a relatively small number of the-possible treatment combinations 
to blocks ina way that permits within-block estimates of the most important 
sources of variation. à 

Small block size in this context is equivalent to homogeneity of the con- 
ditions under which the treatment effects are measured. There can be con- 
Siderable variation among blocks; this source of variation does not affect 
the precision of the within-block information. When practical working 
conditions rule against having a complete replication within a single block 
and still assure homogeneity of the uncontrolled sources of error, then 
balanced incomplete-block designs provide the next best alternative. › 

Most of the plans to be developed use all the treatment combinations 
required for the complete factorial experiment. Within any one block only 
a fraction of all possible treatment combinations appear; the number of 
treatment combinations per block will be called the block size. The primary 
object of these plans is to control experimental error (1) by keeping the 
block size small and (2) by eliminating block differences from experimental 
error. The cost of this added control on experimental error will be the loss 
of some information on higher-order interactions. Differences between 
blocks will form a part of some interaction; hence components of such inter- 
actions will be confounded with the block effects. Whether or not the 
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sacrifice of information on components of higher-order interactions is worth 
the added control depends upon the magnitude of the block effects. 

Admittedly there is some danger in using designs in which any effect is 
confounded. This is particularly true in exploratory studies. However, 
the potential advantage of these designs—increased precision with respect 
to effects of primary interest—provides the experimenter with a potent 
source of motivation for their use. 

In this chapter the general principles underlying designs involving con- 
founding of interactions will be considered; the analysis and applications of 
these designs will also be illustrated. In balanced designs some informa- 
tion will be available on all components of the interactions; in other designs 
some of the components of interactions will be completely confounded with 
between-block effects. The designs to be discussed in this chapter resemble 
those to be discussed in Chaps. 9 and 10. The designs in Chap. 10, which 
use the Latin-square principle, are actually special cases of the designs in 
this chapter. The incomplete-block designs presented in Chap. 9 are 
single-factor experiments rather than factorial experiments ; however, quasi 
factors are introduced for purposes of the analysis. 

Construction of the designs to be presented in this chapter depends upon 
techniques for analyzing interaction terms into parts. The first step is to 
divide the treatment combinations into sets which are balanced with respect 
to the main effects of the factors involved. For example, in a 2 x 2 fac- 


torial experiment the following sets are balanced with respect to the main 
effects: 


Set I Set II 
aby, abis 
Gb»; abs, 


Note that a; appears once and only once in each set. Similarly, а, b, and 
b, each appear once and only once ineachset. Thus, there is balance with 
respect to the main effects of both factors 4 and B. Any difference between 
the sums for the sets is not a function of the main effects, but rather a function 
of the AB interaction. These results follow only if factors A and B are 
fixed; blocks are assumed to be random. Further, the underlying model is 
strictly additive with respect to block effects; i.e., no interactions with 
block effects appear in the model. 


Хик = + e + B; + abis + (block), + Em(ikm)+ 


In obtaining the components of a three-factor interaction one divides all 
treatment combinations into balanced sets. In this case, however, the 
balance is with respect to all two-factor interactions as well as all main 
effects. For example, in a 2 x 2 x 2 factorial experiment balanced sets 
with respect to the ABC interaction are: 
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Set I Set II 
абсуц abc 
абсуә» абсәуү 
абсзу» abCioy 
аса аБсэзз 


Note that each level of each factor occurs an equal number of times within 
aset. Note also that all possible pairs of treatments occur once and only 
once with each set. Hence each set is balanced with respect to the A, B, 
and C main effects as well as AB, AC, and BC interactions. Therefore, the 
difference between the sum of all observations in set I and the sum of all 
observations in set II is a function of the ABC interaction. 

Procedures for constructing balanced sets of treatment combinations are 
given in later sections. Comparisons between the sums over balanced sets 
define components of the interactions. 

The sets of treatment combinations which are balanced with respect to 
main effects and lower-order interactions will be assigned to separate 
blocks. Hence the between-block differences, though free of main effects 
and lower-order interactions, will be confounded with components of the 
higher-order interactions. By replicating the experiment an appropriate 
number of times, different components of higher-order interactions can be 
used to form the sets of treatment combinations that make up the blocks. 
Hence some information on all components of the higher-order interactions 
will often be obtainable from the experiment as a whole. 


82 Modular Arithmetic 

The material to be presented in later sections is simplified through use of 
modular arithmetic. By definition an integer / modulus an integer m is the 
remainder obtained by dividing Zby m. For example, the integer 18 to the 
modulus 5 is 3, since the remainder when 18 is divided by 5is3. This result 
is usually written 


18 (mod 5) = 3 
and is read “18 modulo 5 is 3.” Other examples are: 
20 (mod 5) = 0, 
7(mod 5) = 2, 
3 (mod 5) = 3. 


Alternatively, 18 is said to be congruent to 3, modulus 5; 20 is said to be 
congruent to 0, modulus 5. Thus, all integers are congruent to one of the 


integers 0, 1, 2, 3, or 4, modulus 5. 
If the modulus 3 is used, all integers are congruent to 0,1, or 2. For the 


m 
odulus 3, 18 (mod 3) = 0, 


7 (mod 3) = 1, 
20 (mod 3) = 2. 
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For purposes of the work in the following sections, the moduli will be 
limited to prime numbers, i.e., numbers divisible by no number smaller than 
it except unity. For example, 1, 2, 3, 5, 7, 11, etc., are prime numbers. 
The operation of modular addition is shown in the following examples: 

2--1- 0 (mod 3), 

0 + 2 = 2 (mod 3), 

2 --2-— 1 (mod 3); 


2--2- 4 (mod 5), 

4 -- 4 = 3 (mod 5), 

1 4- 4 = 0 (mod 5). 
To add two integers, one obtains the ordinary sum, and then one expresses 
this sum in terms of the modulus. Forexample,4 + 4 = 8;8(mod 5) = 3. 
Hence, 4 + 4 = 3 (mod 5). Unless the modulus is understood from the 
context, it is written after the operation as (mod m). The operation of 
multiplication is illustrated by the following examples: 

2:2 = 1 (mod 3), 

2:0 = 0 (mod 3); 


4:2 = 3 (mod 5), 

3:3 — 4 (mod 5). 
The product of two numbers is formed as in ordinary multiplication; then 
the product is expressed in terms of the modulus. 


Algebraic equations may be solved in terms of a modular system. For 
example, by using the modulus 3, the equation 
2x = 1 (mod 3) 
hasthesolution x — 2. To obtain the solution, both sides of this equation 


are multiplied by a number which will make the coefficient of x equal to 
unity, modulus 3. Thus, 


2:2х = 2 (mod 3). 


Since 2-2 — 1 (mod 3), the last equation becomes x = 2. As another 
example, the equation 


4x — 3 (mod 5) 


has the solution x — 2. Toobtain this solution, both sides of the equation 
are multiplied by an integer that will make the coefficient of x equal to unity. 
If both sides of this equation are multiplied by 4, 


4: 4x = 4: 3 (mod 5). 
Expressing the respective products to the modulus 5, one has 
16x:2—:12 or x = 2 (mod 5). 
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Equations of the form ax, + bx, = c (mod m) may always be reduced to 
the form x, + dx, = k (mod m). For example, the equation 
Г 2x, + x, = 1 (mod 3) 
becomes, after multiplying both sides by 2, 
ху + 2x4 = 2 (mod 3). 
As another example, 
2x, + 4x, = 2 (mod 5) 
becomes, after multiplying both sides by 3, 
xı + 2x, = 1 (mod 5). 
The equation 2x, + 4x, = 2 (mod 5) and the equation x, + 2x, = 1 (mod 5) 
have the same roots. It may be verified that when x, = 0, x, = 3; hence 
one pair of roots for these equations is (0,3). Other roots are (1,0) and 
(2,2). То show that (2,2) is a root of the equation x, + 2x, = 1 (mod 5), 
Substituting x, — 2 and x, — 2 in this equation yields 
2 + 22) = 1 (mod 5). 
The numerical value of the left-hand side of the equation is 6, which to the 
modulus 5 is 1. 


8.3 Revised Notation for Factorial Experiments 

The introduction of a modified notation for the treatment combinations 
in a factorial experiment will permit more convenient application of modu- 
lar arithmetic. This revised notation is illustrated for the case ofa 3 x 3 x 2 
factorial experiment in Table 8.3-1. The three levels of factor A are 
designated by the subscripts 0, 1, and 2. The treatment combination con- 
sisting of level a, level by, and level c; is designated by the symbol (101). 


Table 8.3-1 Notation fora 3 x 3 x 2 Factorial Experiment 


Co сі y 

Б inap eed ы 80% 
(ооу (010 (020) | (00) (01) (021) 
2^ | Qo) (10) 020) | doD Q1) (ш) 
a | Qo) Qi) (20 | Qo) CID 02) 


The digit in the first position indicates the level of factor A, the digit in the 
Second position indicates the level of factor B, and the digit in the third 
Position indicates the level offactor C. Thus,the symbol (ijk)represents the 
treatment combination abc;;,. 

To illustrate how modular arithmetic may be used to define sets of treat- 
ment combinations, consider a 3 x 3 factorial experiment. All the treat- 
ment combinations in this experiment may be expressed in the form (ij) by 
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suitable choice of i and j. Let x, stand for the digit in the first position of 
this symbol, and let x, stand for the digit in the second position of this sym- 
bol. The relation x, + x, = 0 (mod 3) is satisfied by the symbols (00), 
(12), and (21). То show, for example, that the symbol (12) satisfies this 
relation, substituting x, = 1 and x, = 2yields 1 + 2 = 0 (mod 3). Equiv- 
alently, the relation x, + x, = 0 is said to define or generate the set of 
treatment combinations (00), (12), (21). By similar reasoning, the relation 


X, + Xa = 1 (mod 3) defines the set (01), (10), (22); 
and the relation 

Xy + Ха = 2 (mod 3) defines the set (02), (11), (20). 
Each of these sets is balanced with respect to main effects. 


8.4 Method for Obtaining the Components of Interactions 


The method of subdividing the degrees of freedom for interactions to be 
described in this section applies only to factorial experiments of the form 
p* р X'+ x p, where p is a prime number, Thus, this method applies to 
experiments of the form 2 x 2,2 x 2 x ЖАЙЫН O UX: 8, x3-x 3.283 
5хХ5,5х5х5,...; еіс. АЗ x 3 factorial experiment will be used 
to illustrate the method. The A x B interaction, which has four degrees 
of freedom, consists of an / component having two degrees of freedom 
and a J component having two degrees of freedom. The sums of squares 
for A x B may be partitioned as follows: 


SS df 
AxB 4 
ABU) 2 
АВ(Ј) 2 


The symbol A 8(/) denotes the /component ofthe A x Binteraction. Such 
components may have no meaning in terms of the levels of the factors; their 
purpose is merely to provide a convenient method for subdividing the 
degrees of freedom, 

, An alternative, but more convenient, notation for the components of the 
interaction uses the symbol AB for the J component and the symbol 487 
for the Z component. The rationale underlying this notation scheme will 
become clear when the computational procedure for these components is 
described. Thenine treatment combinations in a 3 x 3 factorial experiment 


may be divided into three nonoverlapping sets by means of the following 
relations: 


Xi + 5а = 0 (mod 3) х + ха = I (mod 3) ху + ху = 2 (mod 3) 
(00) (01) (02) 
(12) (10) (11) 


Q1) Q2) (20) 
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The digits in the symbols for the treatments satisfy the relations under which 
the symbols appear. No treatment combination appears in more than one 
. set; all treatment combinations of the 3 x 3 factorial experiment are 
included. Each of these sets is balanced with respect to the main effects of 
both factors (assuming that factors A and B are both fixed factors), 

To illustrate what is meant by balance in this context, assume that the 
following linear model holds: 


X, = u ++ By (Bu + м» 


where X,«, = 0, Xf, = 0, and (жб), = Хай) = 0. When A and Bare 
fixed factors, these restrictions on the parameters in the model follow from 
the definition of the effects. Ап observation made on a treatment combina- 
tion on the left-hand side of the following equations estimates the sum of the 
parameters indicated on the right-hand side, 


(00) = и + a + Bo + («бух + Foor 
(12) = и + а + By + (afha + е 
(21) = и + a + By + Da + ба. 


The sum of the observations on this set of treatments will not contain effects 
associated with either of the main effects, since a, + % + % = 0 and 
Po + By + By = 0. The sum of the observations in this set of treatments 
will, however, involve an interaction effect, since («уу + (2/01; + (2/)4 ® 0. 
Thus, the set (00), (12), (21), which is defined by the relation x, + X, = 0. 
(mod 3), is said to be balanced with respect to both main effects but not 
balanced with respect to the interaction effect. The sets defined by the 
relations x, + x = 1 and x, + Xa = 2 are also balanced with respect to 
both main effects but unbalanced with respect to the interaction effect. 

Differences between the sums of the —€— Lm sech S e e 
sets define two of the four degrees of от of the A x B inte р 
These two degrees of freedom аге associated with what is called the AB(J) 
component (or the AB component) ofthe A х Binteraction. The numeri- 
cal example in Table 8.4-1 illustrates the computation of this component of 
the interaction. The entries in the cells of part i of this table arcassumed to 
be the sums over four observations; i.c., the data in part i represent а sum- 
mary of a 3 x 3 factorial experiment in which there are four observations 
per cell. ! - 

The sum of squares for the over-all A х B interaction, which has four 
degrees of freedom, is computed in part ii. The sum of squares for the AB 
component, which has two degrees of freedom, is computed in part iii. 
The sum of all observations in the set of treatment combinations defined by 
the relation x, + x, = 0 is denoted by the symbol (48). Similarly the 
sum of all observations in the set defined by x, + X; = 1 is denoted by the 
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Table 8.4-1 Computation of Components of Interaction in 3 x 3 
Factorial Experiment Having Four Observations per Cell 
з RS шынары ыыы лы ils t M NN 

| bo b, bs 


ар | 45 20 20 85 = Ay 
(i) @ р 502 20 95 = A, (n = 4) 
dy | 20 $50 80 150-А, 
90 120 120 330-G 
By В, В, 
= Nae 
45? + 20% + +--+ 502 + 80? 85° + 95° + 150? 


S = 
Заха 4 12 
Gi) _ 90° + 120° + 120? , 330? 
12 72367 
= 633.33 
ee Кызыны. _ _ 
X +X» = xy +X, =1 Xx + % =2 
(00) = 45 (01) = 20 (02) = 20 
2 (12) = 20 (10) = 25 (11) = 50 
(iii) (21) = 50 (22) = 80 (20) = 20 
(AB), = 115 (AB), — 125 (АВ), = 90° 
115° + 125? + 902 330? 
GR Lux Tr iw з= 
12 36 
= 5417 
SESEO OSY C uM n 1 oos 
xp + 2ху = х + 2х = 1 ху +2%=2 
(00) = 45 (02) = 20 (01) = 20 
" (11) = 50 (10) = 25 (12) = 20 
(iv) (22) = 80 (21) = 50 (20) = 20 
(АВ?) = 175 (АВ?) = 95 (AB). = 60 
SS, = 115% + 95* + 60? — 330? 
АВ j 12 736 
= 579.17 


нансыз. 
БОДЫ (AB),. The sum of Squares corresponding to the AB component is 
given 
2 SS, = (AB) + (АВ) + (АВ? G? 
$ 3n 9n' 


where n is the number of observations in each of the cell entries in part i. 
For the data in part i, SS (в = 54.17. 

The АВ? component of the interaction is obtained from sets of treatment 
combinations defined by the following relations (modulus 3 is understood): 


Xy 2x59 = 0 х1 оха х + 2x, = 2 
(00) (02) PERG Ic 
(11) (10) d 


Q2) Q1) (20) 
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These sets are also balanced with respect to the main effects. Hence dif- 
ferences among the sums of the observations in these three sets will define a 
component of the over-all interaction—in this case the 4В” component. 
The numerical value of this component is computed in part iv of Table 8.4-1. 
The symbol (48°), in this part denotes the sum of all observations in the set 
of treatments defined by the relation x, + 2x, =0. The sum of squares 
for this component is given by 
(AB?) + (AB); + (АВ? G? 

3n ТЕ 


SS, 


The numerical value for this sum of squares is 579.17. Р 
The partition of the over-all interaction may be summarized as follows: 


Source 55 df 
AxB 633.33 4 
AB 54.17 2 
AB* 579.17 2 


уз Лаан наана. 
Within rounding error 584, в = SS45 + SSis: In this case the AB? 
component of the interaction is considerably larger than the A Bcomponent. 
Inspection of part i of Table 8.4-1 indicates that the large totals are on the 
diagonal running from upper left to lower right. The treatment totals on 
this diagonal are in the same set for the AB? component but in different sets 
for the AB component. Hence the AB? component is large relative to the 


AB component. 


Table 8.4-2 Simplified Computation of the Components of the 
A х B Interaction 


a 25 50. 


ай MS 

(4B), — a i RC Um - (AB?) 

(АВ), = ix. `-20--- 95 = (АВ?) 
5 


(AB), = 125——a,—25 0 720--- 60 = (ИВ?) 


The computation of the components of the A x B interaction may be 


Simplified by use of the procedure illustrated in Table 8.4-2. The cell entries 
in part i of this table are the same as those in part i of Table 8.4-1 , with rows 
4; and a, appearing twice. Totals of the form (A В); аге үй by ш 
ming the entries on diagonals running from upper right to lower left; totals 
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of the form (АВ?), are obtained by summing entries on diagonals running 
from upper left to lower right. From these totals the components of the 
interaction are computed by the same formulas as those used in Table 8.4-1. 
The treatment combinations falling along these diagonals actually form the 
balanced sets defined by modular relations. 

The use of modular arithmetic in defining balanced sets of treatment 
combinations is readily extended to 5 x 5 factorial experiments. In this 
latter type of experiment the 16 degrees of freedom for the A х Binteraction 
may be partitioned as follows. 


Source df 
AxB 16 
AB 4 
АВ? 4 
AB? 4 
AB! 4 


The balanced sets of treatment combinations from which the АВ? com- 
ponent of the interaction is obtained are defined by the following relations, 
all with respect to the modulus 5: 


хі +3ху=0 Сх + 3х = 1 xj + 3x%,=2 0 x, +3x,=3 x+ 3x =4 


(00) (02) (04) (01) (03) 
(13) (10) (12) (14) (11) 
(21) (23) (20) (22) (24) 
(34) (31) (33) (30) (32) 
(42) (44) (41) (43) (40) 


Each of these sets is balanced with Tespect to both main effects. The 
balanced sets defining the 484 component of the interaction are obtained 
from the relation x, + 4x, = i (modulus 5), where i = 0,..., 4. 

The computation of the components of the A x Binteractionina 5 x 5 
factorial experiment may be carried out by an extension of the method used 
in Table 8.4-2. This extension is illustrated by the data іп Table 8.4-3. In 
part i, the totals required for the computation of the 4B component appear 
at the left, and the totals required for the 4B* component appear at the 
right. In part ii, totals required for the АВ? and the 4B? components are 
obtained. The arrangement of the /'s in part ii is given by multiplying 
subscripts to b’s in part i by 2 and then reducing the resulting numbers 
to the modulus 5. By performing this operation, the sequence bo, by, В 
bs, by becomes the sequence Б, bz, b4, by, bs; the latter sequence appears in 
partii. The AB? component of the interaction is obtained from the totals 
in the lower right of part ii. Assuming one observation per cell, 


SS, = ÉS +10” + 65? + 80? + 1002 380° 
5 


174.00. 
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Table 8.4-3 Computation of Components of Interaction in a 5 x 5 
Factorial Experiment 


аз 20 i: “15 10 10. 


97 5--- 50 -(АВ% 


@ (АВ) -110--а-30 20 75 M 
Ere ore 


5/ 10 15 20  25--- 65 —(AB* 


(АВ), = 80——a, 25 VE 
(AB), — 55 "c LN 725 ^30--- 85 = (А8), 
(АВ), = 55——4; lii 15 15 `20---100 = (AB"), 
(АВ), = 80——a,4——20 15 10 10 `5--- 80 = (AB), 
380 380 

b tore Кы cb, % 

a А 25 155 

an ОЛО 20 HL g 

a 1% DA эз. 15 

ase 9 yw vie 

а ОО ARIOI 2081550710 
bai ВС 
(ii) (AB), = 95——a,——30 api qt 20 ert 65 = (АВ?, 
(AB), = 55——a,— 5 обн 20--- 70 = (АВ?) 
(AB), = 60 lig 20° 7307 715. 25--- 65 = (AB), 
(AB), = 90---а; ie d Қ 715 15--- 80 =(4B"), 
(АВ?) = КЕЗЕК” 5 15 10---100 -(АВ?, 


380 380 
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Other components are obtained in an analogous manner. The numerical 
values of the components of the interaction for the data in Table 8.4-3 are 


as follows: 
pat ohe le et tt „ЫШ 


Source SS df 
AxB 1136.00 16 
AB 414.00 4 
AB? 174.00 4 
AB? 254.00 4 
АВ" 294.00 4 
Rea Е 


Ina3 x 3 x 3 factorial experiment, the three-factor interaction may be 
partitioned into the following components: 


ate LAAN A ны 
Source df 
AXBXC 8 
ABC 2 
ABC? 2 
AB?C 2 
АВ?С? 2 

a SE 5. 


The sets of treatment combinations from which the 4B2C? component is 
computed are defined by the following relations: 


Жа + 2% + 2x, — 0 (mod 3) x, + 2x, + 2x, = 1 (mod 3) xı + 2x, + 2x, = 2 (mod 3) 


(000) (002) (001) 
(012) (011) (010) 
(021) (020) (022) 
(101) (100) (102) 
(110) (112) (111) 
(122) (121) (120) 
(202) (201) (200) 
(211) (210) (212) 
(220) (222) (221) 


The sum of the observations within each of these sets is balanced with respect 
to all the main effects and all the two-factor interactions. There is no 
balance for the three-factor interaction. Hence differences between sums 
over the sets form one of the components of the three-factor interaction, in 
this case the АВ?С? component. Treatment combinations from which 
the АС component is computed are defined by relations of the form 
Xi + 2x, + x3 = i (mod 3Y. 

Р The computation of the components of the three-factor interaction is 
illustrated in Table 8.4-4. In parti summary data for observations made at 
level Со are given. Summary data for levels су and c, appear in parts ii and 
ш, respectively. Totals Corresponding to the AB component for level co 
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appear at the left in part i. These totals also appear in part iv under the 
column headed c,. The totals at the left of parts ii and iii make up the 
columns c, and c, in partiv. The ABC component of the three-factor inter- 
action is computed from totals at the left of part iv; the ABC? component 
is obtained from the totals at the right of part iv. For example, if there are 


Table 8.4-4 Computation of the Components of a Three-Factor 
Interaction in a 3 x 3 x 3 Factorial Experiment 


(i) ap 13 ras 
(AB), = 263——a,—— 86. а 25 MM ace a = (AB), 


(AB), = 400——a,——173 106 `170---319 = = (AB*)y 


`71- - 431 = (AB), 


(AB), = 489---ау 126 


сі 
ГАРРЕТТ 
a, 127 158 06 
(ii) ape 80 (o On 


(AB), = 307——a,— 927 7139 7104---340 = (АВ? 


(АВ) - 158. 7106- --325 = (АВ?), 


(АВ), = 342——a,—— 80 109 7123-2373 = (AB), 


(iii) ооз 222000? 
(AB), = 1983 ——a,— 92,101. “60---215 = (АВ? 
(AB), = 313——a;——103 80 7*54- — 218 = (AB), 


(AB), -263—a—123. 52 109---281 -(АВ?, 
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Table 8.4-4 (cont.) 


Со Сі 
(AB), 49 389 313 
(АВ), sa a 
(iv) (ABC), = йде ша К АЫ = (ABC), 


(ABC), = 1059——(АВ„——4897° "389 | 313- - -1020 = (ABC?), 
(АВС), = 987—(AB),—— 400 342 763--- 915 = (АВС), 


2964 2964 
————— e — ПОСНИ ЫИ 
Cp e Cy 
(AB), 396 иж 
(AB) 437 7950 278 
9 СС 
(АВС), = 859---(АВЭ),-3197 737) 7281-—-1002 = (AB*C9), 


396° 7340: 7215---1025 -(АВ?СЭ, 


(АВ?С), = 1041---(АВЭ, 


(АВС), = 1058 —(4 В), —437 ИО Тоат (AB*C?), 


ғ observations in each cell ofa 3 x 3 x 3 factorial experiment, the ABC 
component is given by 


918° + 1059? + 9872 — 2964? 
55івс “ 


D» 27r 


In part v of the table, the entries under column c, are obtained from the 
totals at the right of parti. Тһе entries in columns c, and с, are obtained, 
respectively, from the totals to the right of parts iiandiii. Totals at the left 
of part v are used to obtain the АВ?С component of the three-factor inter- 
action; totals at the right are used to obtain the 4 "ep component. 

The treatment combinations that are summed to obtain the total (A B*C?), 
in part v may be shown to be those belonging to set x, + 2x, + 2x, = 0. 
Similarly the total (4 B'C?), is the sum of all observations on the treatment 
Set x, + 2x, + 2x3 = 1. Тһе scheme presented in Table 8.4-4 provides a 
convenient method for obtaining the sum over balanced sets. In general 
the symbol (A4B'C/), represents the sum of all treatment combinations 
satisfying the relation 


Xı + ix, + jx, = k (mod 3). 
Rather than using the scheme presented in Table 8.4-4, it is sometimes more 


convenient to form the sets of treatment combinations by means of their 
defining relations and obtain the sums directly from the sets. 
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In the notation system used by Yates as well as Cochran and Cox, the 
following components of the three-factor interaction in a 3 x 3 x 3 fac- 
torial experiment are equivalent: 
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Modular notation Yates notation 


ABC ABC(Z) 
ABC? ABC(Y) 
АВС АВС(Х) 
АВ?С? ABC(W) 


The modular-notation system is the more convenient and lends itself to 
generalization beyond the 3 x 3 x 3 factorial experiment. For example, 
in а 5 х 5 x 5 factorial experiment, the three-factor interaction may be 
partitioned into the following parts: 


Source df 
A Bux 64 
ABC 4 
ABC? 4 
ABC? 4 
ABC* 4 
AB*C 4 
АВС 4 


There аге 16 components in all, each having four degrees of freedom. The 
balanced sets used to obtain the 4B!C* component are defined by the rela- 
tions x, + 4x, + 4x3 — 0, 1, 2, 3, 4 (mod 5). For example, the sum 
(АВАС), is obtained from the treatment combinations belonging to the set 
defined by the relation x, + 4x + 4x, = 2; there will be 25 treatment 


combinations in this set. ( г 1 
The notation system for the components of the interaction is not unique 


unless the convention is adopted that the exponent of the first letter is always 
unity. For example, ina 3 x 3 factorial experiment АВ and AB? define 


the same component, since 
2x, + хз = i (mod 3) 
defines the same set of treatment combinations as 
2(2)x, + 2x, = 2i (mod 3), 
x, + 2x, = 2i (mod 3). 
Multiplying both sides of a defining relation by a constant is algebraically 


equivalent to raising the symbol for the corresponding component to a 
power equal to that constant. In this case theconstantis2. Hence 


(А?В)? = A'B? = AB, 


upon expressing the exponents modulo 3. 
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As another example, іп a 5 x 5 factorial experiment 4?" is equivalent 

to AB*. To show this, 

3x, + 2x, = і (mod 5). 
Multiplying both sides of this equation by 2, 

6x, + 4x, = 2i (mod 5), 

X, + 4x, = 2i (mod 5). 
Alternatively, А?В? = (493p?) = 49g! — дві 
8.5 Designs for 2 x 2 x 2 Factorial Experiments in Blocks 

of Size 4 


The eight treatment combinations in a 2 x 2 x 2 factorial experiment 
may be divided into two balanced sets in sucha way that all main effects and 
two-factor interactions are balanced within each set. The plan in Table 
8.5-1 provides such sets. The treatment combinations in block | satisfy the 


Table 8.5-1 2 x 2 x 2 Factorial Experiment 
in Blocks of Size 4 


Block 1 Block 2 
(000) (001) 
(O11) (010) 


(101) (100) 
(110) (111) 


relation x, + X» + Хз = 0 (mod 2), and those in block 2satisfy the relation 
Xı + Ха + Ха = 1 (mod 2). For ғ replications of this experiment the 
analysis takes the form given in Table 8.5-2. It is assumed that the blocks 
are random but that factors A, В, and Care fixed. It is also assumed that 
the treatment by block interactions is negligible relative to other effects in 
the experiment. 

The tests on main effects and two-factor interactions use the within- 
block residual in the denominator of Fratios. The test on the three-factor 
interaction uses the between-block residual as thedenominator. Thislatter 
test will generally be considerably less sensitive than tests which are based 
upon within-block information. The within-block residual in this design 
is the pooled interaction of the replications with the main effects and with 
the two-factor interactions, 

The plan in Table 8.5-1 does not provide any within-block information on 
the three-factor interaction, By way of contrast the plan in Table 8.5-3 
provides within-block information on the three-factor interaction as well as 
the two-factor interactions, A minimum of eight blocks is required for this 
latter design. In the first replication, AB is confounded with differences 
between blocks 1 and 2. However, within-block information on AB is 


| 
E 
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available from all other replications.. The treatment combinations in block 
1 satisfy the relation x, + x, = 0 (mod 2); the treatment combinations 
appearing in block 2 satisfy the relation x, + x, = 1 (mod 2). Since 


Table 8.5-2 Analysis of 2 x 2 x 2 Factorial Experiment in 
Blocks of Size 4 


Source of variation df. E(MS) 
Between blocks к= 
Replications ne 
Blocks within reps r 
ABC 1 92 + “біра + 1088 
Residual (between blocks) red оў + Aofiocks 
Within blocks 6r 
E" E o? + 4ro2 ~ 
B 1 оў + 405 
G 1 o + 405 
АВ 1 оё + 2гбаӊ 
AC 1 оў + 2104 
BC 1 оў + 2гоў, 
Residual (within block) 6r—6 | о? 


—————————— 


within-block information оп ABis available from six of the eight blocks, the 
relative within-block information on the AB interaction is 8 = 4. | 

To show that block 3, for example, provides within-block information on 
AB, two of the four treatment combinations in this block satisfy the relation 


Table 8.5-3 Balanced Design with Partial Within-block 
Information on All Interactions 
AB eo ee a eee 
Rep 1 Rep 2 Rep 3 Rep 4 


Block 1 | Block 2 | Block 3 | Block 4 | Block 5 | Block 6 | Block 7 Block 8 


(000) | (010) | (000) | (001) | (000) | (001) | (000) | (001) 
(001) | (011) (010) | (оп) | (O11) | (00) | (01D n 
(110) | (100 | (10) | (100 | (100 | (10) | (10) (20 
бап) | по) | ап) | аю | аш) | (10) | шо) | (11) 


AB AC BC ABC 
IEEE СТ ЛЮ ЕТ үл ү, 


X, + x, = 0 and hence belong to what will be called the set Jo, that is, 
(000) and (110). Тһе other two treatment combinations in block 3 belong 
to the set Л, that is, satisfy the relation x, + Xy = 1. The difference 
between the totals for the J, set and the J, set provides information on the 
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AB component, which is free of block effects. In addition to block 3, 
each of blocks 4 through 8 contains two treatment combinations belonging 
to set Jọ and two belonging to set J}. 

The AC interaction is completely confounded with the difference between 
blocks 3 and 4, but within-block information on AC is available from all 
other blocks. Hence the relative within-block information on AC is ford. 
In block 3, the treatment combinations satisfy the relation x, + x, =0 
(mod 2); in block 4 the treatment combinations satisfy the relation 
Xı + хз = l (mod 2). Blocks 7 and 8 in Table 8.5-3 are identical to blocks 
1 and 2 of the design in Table 8.5-1. In the latter design, no within-block 


Table 8.5-4 Analysis of Balanced Design 


Source of variation df E(MS) 

Between blocks 7 

Replications 3 

Blocks within reps 4 
АВ 1 
АС 1 
BC 1 
ABC 1 

Within blocks 24 
A 1 o + 1602 
B 1 o? + 1663 
С 1 02 + 1602 
(AB) 1. о? + (D803, 
(AC) 1 02 + (2802, 
(всу 1 в? + (863, 
(ABC) 1 oF + (4035, 
Residual 17 о? 


information is available on the ABC interaction; the design in Table 8.5-3 
provides 3 relative within-block information on ABC, such information 
being available from blocks 1 through 6. 

The analysis of the design in Table 8.5-3 takes the form given in Table 
8.5-4. Тһе symbol (АВ) indicates that only partial within-block informa- 
tion is available for AB. In the computation of (4B)', only information 
from blocks 3 through 8 is used. The AB comparison given by the dif- 
ference between block 1 and block 2 is confounded by differences between 
these blocks; this comparison represents one of the four degrees of freedom 
for blocks within replications. In computing (AC)’, only information from 
blocks 1, 2, and 5 through 8 is used. Similarly, (АВС) is based upon in- 
formation from blocks 1 through 6. The différence between the total for 


A, 
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block 7 and the total for block 8 gives rise to the between-block component 
of ABC. 

Each of the main effects is determined from data obtained from all four 
replications; the within-block information on the interactions is based upon 
data obtained in each case from three of the four replications. Hence there 
are three effective replications for the interactions, but four effective replica- 
tions for the main effects. The degrees of freedom for the within-block 
residual are made up of the following parts: 


Residual 

A x reps 

B x reps 

C x reps 
(AB) x reps 
(AC)' x reps 
(BC) x reps 
(ABC). x reps 


к. 
эәәәәәеіл 


Since each of the interactions has only three effective replications, their 
interaction with replications has only two degrees of freedom. 

The expected values for the mean squares given in Table 8.5-4 assume 
that there is one observation per cell. Suppose that each block represents 
à group of л subjects and that the groups are assigned at random to one of 
the eight blocks. Also suppose each subject within the groups is observed 
under each of the four treatment combinations in the block to which the 
group is assigned. Further assume that terms of the form оу аге eq ual to 
zero. Under these assumptions the analysis takes the form given in Table 
8.5-5. Іп this analysis the individual subject is considered to be the experi- 
Mental unit. Tests on main effects and interactions use the residual mean 
Squarein the denominator of the Ғтайо. Тһегеіѕ usually nointerestin a test 
on differences between blocks or the components of such differences. Іп 
Some cases between-block estimates of the interactions may be combined 
with the corresponding within-block estimates. Such pooled estimates are 
given by a weighted sum of the between-subject and within-subject com- 
ponents, the weights being the respective reciprocals of the between- and 
within-person residuals. F tests on such combined estimates require a 
weighted pooling of the respective residuals. : 

The cell totals for the design in Table 8.5-3 may be designated by the 
following notation: 


bo by 
со с со e 
a Xoo Xoo Xoo Xon 
41 100 Хо. no жы 
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Table 8.5-5 Analysis of Balanced Design with Repeated Measures 
——————— Rem n 


Source of variation df E(MS) 
Between subjects 8n —1 
Groups T 
Reps 3 
AB 1 o? + AGbeople + 2no2, 
AC 1 оў + AGpeople + 2no?,, 
BC 1 о? + 4opeople + 2708, 
ABC 1 92 + AGpeople + 1025; 
Subjects within groups 8(n — 1) 92 + 4Gpeople 
Within subjects 24n 
A 1 92 + 16ло? 
B 1 ор + 16поў 
с 1 оў + 16по% 
(AB) 1 | 9 + (2)8n02, 
(Acy 1 оў + (2)8n02,, 
(BC) 1 оў + (2)8no$,, 
(ABC) 1 a? + ()4л02,, 
Residual 24n —7 о? 


SS EE 


Each of these totals will be based upon four observations. Since all the 
treatment combinations do not appear in all the blocks, means based upon 
these totals will not be free of block effects. If, for example, the cell total 
for treatment combination (101) were free of block effects, it would estimate 


Xii. == 4u + 4а + 46 + 4y, + A(xB)i + ay) 
+ Ву) + 49у) 
+ (sum of all block effects). 


In this last expression, the sum of all block effects would be a constant (under 
the assumptions made); hence differences between any two Хз would be 
free of block effects. It is readily verified that the following quantity will 
estimate the sum of the parameters on the right-hand side of the last 
expression: 

xa, = A ^ + Су | (AB) + (4С) + с + (АВС), T, 


where (AB); = total ofall treatment combinations which satisfy the relation 
Xi + Xə = 1 (mod 2) and appear in blocks 3 through 8; similarly (AC)o = 
total of all treatment combinations which satisfy the relation x, + Xs = 0 
(mod 2) and appear in blocks 1,2,4 through 8. The quantity Xj; is called 
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an adjusted cell total. The adjusted cell mean for treatment combination 
(101) is 


For the general case, an adjusted cell total for the design in Table 8.5-3 
has the form 
A; + By + € (АВ); + (АС) + BOS +x 
4 } 3 
(ABC) 54% fa 3G 
3 4 


where (AB);,; = sum of all treatment combinations which satisfy the 
relation x, + x, = (i + j) (mod 2) and which appear in blocks providing 
within-block information on AB. Similarly (ABC);,;,, = sum of all 
treatment combinations which satisfy the relation x + xo + % = G@+j+ 
k) (mod 2) and which appear in blocks providing within-block information 
on ABC, ) 
Adjusted totals having the form X; are obtained from the relation 
Xis. = Xi. + Xin. 
Similarly, Xix. = Xii + Xii. 
Individual comparisons among the treatment combinations use the adjusted 
cell totals. The effective number of observations in the adjusted cell total 
Xj, is actually somewhat less than four since there is only 1 relative within- 
block information on the interaction effects. For most practical purposes 
a comparison between two adjusted cell totals is given by 


2 
Ж. (Xie = Хь) 


X; ik = 


+ 


, 


| LUE ied 
pees Хра)? 
16MSres 


A numerical example of the design given in Table 8.5-1 appears in Sec. 
8.7. Computational procedures are simplified by means of special pro- 
cedures for 2" factorial experiments given in the next section. 


8.6 Simplified Computational Procedures for 2” Factorial 
Experiments 

Computational procedures for 2 x 2, 2 x 2 x 2, 2 x 2 x 2 x 2, etc., 
factorial experiments may be simplified by use of the device employed in 
forming balanced sets. This simplified procedure was presented in a dif- 
ferent form in Sec. 6.12. Тһе simplified computational procedures will be 
illustrated by the numerical example given in Table 8.6-1. Cell totals in 
part i are the sum of five observations. The sums of squares obtained 
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by the procedures to be outlined are the unadjusted sums of squares if the 
experiment involves any confounding. 


Table 8.6-1 2 x 2 x 2 x 2 Factorial Experiment with Five 


Observations per Cell 
лл ee ____—_—____ 
by by 
со сі Co сі Total 
0) = 
ao 5 10 15 15 45 
a 10 20 20 5 55 9 
15 30 35 20 100 - 
—— ғы шамы АБК 
Treat 
comb: (000) (001) (010) (011) (100) (101) (110) (111)| Comparison 
Cell i 
VTL BANI) 10-7 So а Tre Шр ЖИШШ 100 
G E Rea ise Eom NT ЖС ET н 100 
А = = ut gi 22 Б d + | 10 
nB = ла vo р = — + і 10 
(ii) С LEE жаа. — + - [ 0 
АВ шы ын x 1 -20 
АС ee ж ы + -10 
ВС + = = + + — - + =30 
ABC EN Т Bey IER + —20 
Пат ara a U UM 2001 
u0)? (-10) 
554 —2—— = 2:50 p m 2: 
4 = s) 5540 50) 2.50 
(10)? (—30) 
22 ба = o 
(iii) В 508) 2 SS nc 5(8) 2230 
з (—20) 
SS; = 0 5 
С авс = ~y = 1000 


‚ The patterns of the algebraic signs in part її are determined as follows (the 
Хх refer to digits in the treatment combinations at the top of each column, 
all addition is modulo 2): 


б: All positive 
A: —ifx, =0 +ifx, = 1 
B: — if x, —0, +ifx, =1 
С ait x; —0, +ifx, =1 
АВ: + if x, + x, —0, —ifx,+x, =1 
AC: + ify +x, -0, —ifx,+x, -- 1 
ВС: иар -- 0) if xa + x, = 1 
АВС: — if x, + х + x — 0, + if х + x2 + x3 = 1 
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In general if the number of factors which are interacting is even, then the 
zero modular sum receives the positive sign; if the number of factors inter- 
acting is odd, then the zero modular sum receives a negative sign. In this 


' context main effects are classified with the interaction of an odd number of 


factors. 

The entry at the right of each row in part ii is a weighted sum of the cell 
totals at the top of each column, the weights being +1 as determined by the 
pattern of signs in each row. For example, the entry at the right of row A 
is given by 


-5 — 10 — 15 — 15 4- 10+ 20+ 20 4- 5 — 10. 


As another example, the entry at the right of row ABC is given by 


-5 + 10 + 15 — 15 + 10 — 20 — 20 + 5 = —20. 


The pattern of the signs in the last expression is that in row ABC. The 
weighted sum of the cell totals for row ABC actually corresponds to the 
difference (ABC), — (АВС), where (АВС), represents the sum of all obser- 
vations on treatment combinations which satisfy the relation 


ху + Ха + Ху = | (mod 2). 


Alternatively, the totals at the right of the rows represent the compari- 
Sons corresponding to the effects at the left of the rows. The general form 
ofa comparison (assuming the same number of observations in each cell 
total) is 

Comparison = cT; + 7+: + cT, Хс = 0, 


where the 77 represent cell totals. Тһе sum of squares corresponding to 
а comparison is given by 


(comparison) 
SScomparison = ТУЯК О TEN, 


where л is the number of observations summed to obtain the cell totals. 
The c’s in part ii аге either +1 or —1, depending upon the sign; for each of 
the rows Nc? — 8. The sums of squares for comparisons corresponding 
to main effects and interactions are computed in partiii. In this example, 
nis assumed to be equal to 5. - А 

The extension of these computational procedures to any factorial experi- 
ment of the form 2^ is direct. Опсе the pattern of signs corresponding to a 
Tow in partiiis determined, the weighted sum of the cell totals corresponding 
to a row gives the numerical value of the comparison. For any comparison 
determined in this manner, Uc? = 25, 
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In 2* factorial experiments a specialized notation is frequently used to 
designate the treatment combinations. This specialized notation for a 
2 x 2 x 2 factorial experiment is as follows: 


by | b, 

Co c Co сі 
а (1) с b bc 
ay a ac ab abc 


The relationship between the notation system that has been used so far in 
this chapter and the specialized notation system is as follows: 


Treatment | | | 
combination . .. | (000) | (010) (001) | (011) | (100) | (110) | (101) | (111) 

Specialized ЕЗ ІШЕ ЕД 
notation...... (1) b с | be a ab | ас | abe 


In the specialized notation system, the symbol for a treatment combination 
contains only those letters for which the factor is at levell. Conversely, ifa 
factor is at level 0 in a treatment combination, then the letter corresponding 
to that factor does not appear in the symbol. 

In terms of the latter notation system sign patterns for the comparisons 
may be determined by a relatively simple rule. In the sign pattern for an 
interaction of the form Y Y, a symbol receives a positive sign if it contains 
both of the letters X and Y or neither of these letters; a symbol receives à 
negative sign otherwise. In the sign pattern for an interaction of the form 
XYZ, a symbol receives a positive sign if it contains all three of the letters 
X, Y, and Z or just one of the three letters; a symbol receives a negative sign 
otherwise. For the general case, in the interaction of an even number of 
factors, a symbol receivesa positive sign if it contains an even number of the 
interacting factors (zero being considered an even number); otherwise it 
receives a negative sign. In the interaction of an odd number of factors, à 
symbol receives a positive sign if it contains an odd number of the inter- 
acting factors; otherwise the symbol receives a minus sign. А main effect 


may be considered included under the latter case. The following example 
illustrates the general case: 


Symbols: (1) b с be a ав ас abe 


A Ге cow oy Сұ cune rer, 
с - - + + - - + =p 
AC + + - = — ui T + 
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For the case of a 2* factorial experiment, the sign patterns for some of the 
comparisons are illustrated below: 


(1) а b ab c ac bc abc d ad bd abd cd acd bcd abcd 


A — c post lene) ae ed eae лене кшн + 
AB Жо-талтоуң Заа НН us wr Medo enter осн 
ABC ЖЕЛГЕ ИГЕ zd me 
ИВС |l — — op o с, tees ap casu rm dim sfr 


Given the pattern of signs corresponding to a comparison, the sum of 

squares for that comparison is readily obtained from the treatment totals. 
A systematic computing scheme for obtaining the comparisons corre- 

sponding to main effects and interactions is illustrated in Table8.6-2. This 


Table 8.6-2 Simplified Computational Procedures 


Effect: 
Treatment y 
БОЕО Total (1) (2) (3) Comparison 
(1) 5 15 50 100 G 
a 10 35 50 10 A 
b 15 30 10 10 B 
(i) ab 20 20 0 —20 AB 
с 10 5 20 0 (е 
ас 20 5 -10 -10 АС 
be 15 10 0 —30 BC 
abc 5 —10 —20 —20 ABC 
Dh 55 divpi ign Nar eres 
Upper half..| |... 100 110 100 
Lower Һа. . e 10 —10 —60 
(i) 
Odds..... 45 60 80 
Evens ..; 55 50 20 


scheme is quite useful when the number of factors is large. The basic 
data in Table 8.6-2 are obtained from part i of Table 8.6-1. The treatment 
combinations must be arranged in the order given in this table. (г а fourth 
factor were present, the order for the treatment combinations having this 
factor at level 1 would be d, ad, bd, abd, cd, acd, bed, abcd.) ] 
The entries in the total column are the cell totals obtained from part i of 
Table 8.6-1. The entries in the upper half of column 1 are the sums of 
Successive pairs of the entries in the total column, ie, 5 Ge 10 = 15, 
15 + 20 = 35,10 + 20 = 30,and15 + 5 = 20. The entries in the lower 
half of column 1 are differences between successive pairs in the total 
column, i.e., 10 — 5 = 5,20 — 15 = 5,20 — 10 = 10and5 — 15 = 10. 
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The entries in the upper half of column 2 are the sums of successive pairs 
of entries in column І, that is, 15 + 35 = 50, 30 + 20 = 50. The entries 
in the lower half of column 2 are the differences between successive pairs of 
entries in column 1, that is, 35 — 15 = 20, 20 — 30 — 10. 

The entries in column 3 are obtained from the entries in column 2 bya 
procedure analogous to that by which the entries in column 2 were obtained 
from column 1. This procedure is continued until column £ is reached, 
where k is the number of factors. In this example k = 3; hence the pro- 
cedure is terminated at column3. The entries in column 3 givethe numeri- 
cal values of the comparisons corresponding to the treatment effects to the 


left of each row. These values are identical to those obtained in part ii of 
Table 8.6-1. 


8.7 Numerical Example of 2 x 2 x 2 Factorial Experiment in 
Blocks of Size 4 


The purpose of this experiment was to evaluate the effects of various 
treatment combinations upon the Progress of mental patients in specified 

' diagnostic categories. The treatments and their levels are given in Table 
8.7-1. It was desired to have 10 patients in each cell of a factorial experi- 
ment, necessitating a total of 80 patients for the experiment. However, no 
more than 20 patients meeting specifications were available from a single 


Table 8.7-1 Definition of Factors 


Factor Level Definition 
Drug A do No drug 4 
а, Drug А administered 
b No drug B 
Drug B 0 5 
8 b, Drug B administered 
Psychotherapy с Мо Psychotherapy 


[71 Psychotherapy administered 
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hospital, but four hospitals were available for the study. It was anticipated 
that there would be large differences among hospitals. In order to prevent 
such differences from confounding main effects and two-factor interactions, 
the design in Table 8.7-2 was used. 


Table 8.7-2 Design and Data for Numerical Example 


Hospital 1 Hospital 2 Hospital 3 Hospital 4 
p р Р p 


a 6 | С, а 14 (ІШІ 3) 
b 10° 3 ab 4 b 15 ab 6 
c 6 ace AS С 9 ас 25 
abc 8 bc 18 abc 12 bc 22 

30 39 50 56 


XXX? = 2605 
„.——— 


In the construction of this design, the relation x, + Xs + Хз = i (mod 2) 
was used to divide the eight treatment combinations into blocks of size 4. 
By using the specialized notation, this procedure is equivalent to assigning 
symbols containing an even number of letters to one set of blocks and sym- 
bols containing an odd number of letters to a second set of blocks. (In 
terms of the design in Table 8.7-2, symbols having an even number of letters 
appear in blocks 2 and 4.) Hospitals were assigned at random to blocks. 
Hospitals 1 and 2 make up one replication; hospitals 3 and 4 make up the 
second replication. Within each replication, the ABC interaction 1s com- 
pletely confounded with between-hospital differences. From each hospital, 
20 patients meeting the specifications were selected and assigned at random 
to subgroups of 5 patients each. The subgroups were then assigned at 
random to one of the treatment conditions allocated to the hospital to which 
the subgroup belonged. 

Each patient was rated by a panel of judges before and after the treatment 
combinations were administered. The difference between these two ratings 
was taken as the criterion of progress. Since some of the treatments were 


administered to the subgroups as a unit, the subgroup of five patients was 


considered to be the experimental unit, rather than the individual patient. 
The mean criterion score for each subgroup is given in Table 8.7-2 to the 
right of the corresponding symbol for the treatment combination. Each of 
these means is considered to be a single observation for purposes of the 
analysis of variance. к m 

Summary tables and details of the computations are given in Table 8.7-3. 
Cell totals for the treatment combinations appear in part 1; these totals are 
obtained by combining the data from the two replications. Each of these 
cell totals is considered to be the sum of two observations. The numerical 
values for individual comparisons corresponding to the main effects and 
the interactions are computed in partii. Here the entries in the upper half 
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of column 1 are obtained from the total column by summing successive 
pairs of values in the total column; the entries in the lower half of column 1 
are obtained by taking the difference between successive pairs of values in 
the total column. Column 2 is obtained from column 1 by an analogous 
procedure, and similarly column 3 is obtained from column 2. Тһе entries 


Table 8.7-3 Summary of Numerical Analysis 


bo by 
Total 
Co с Co сі 
а 5 15 25 40 | 85 
() а 20 40 10 20 | 90 
25 55 35 60 | 175 
ee 
Treatment 
combinations | Total (1) (2) (3) 
(1) 5 25 60 175 G 
a 20 35 115 5 A — A, — Ao 
b 25 55 0 15 В = B, – By 
бі) ab 10 60 5 —75 АВ 
с 15 15 10 55 С=с, – С, 
ас 40 15 5 5 АС 
be 40 25 —30 -5 ВС 
abe 20 —20 —45 —15 АВС 


554 = (5)/16 = 157 554с = (5316 = 1.57 
SSp = (15)2/16 = 14.06 SSgc = (—5)}/16 = 1.57 

3 550 = (55/16 - 189.06 SSazc = (—15)?/16 = 14.06 
(i) 554; = (—75)2/16 = 351.57 
SShospitais = (30° + 39° + 502 + 562)/4 — (175)/16 = 100.19 

SStotar = 2605 — (175)2/16 = 690.94 


in column 3 are the values of the comparisons. The corresponding sums of 
Squares are given in part iii. The sum of squares for the main effect of 
factor A has the general definition 
e 
2%” 
where C, = comparison corresponding to main effect of factor A, 
К = number of factors, 

, T = number of replications. 
In this case C, is the entry in column 3 іп row а, whichis 5. For this case 
r = 2, and К = 3; hence 


5# 25 
себеи) Leeds зі 
и тат 


Saye 
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The analysis of variance for these data is given in Table 8.7-4. The 
within-hospital sum of squares is obtained by subtracting the between- 
hospital sum of squares from the total sum ofsquares. The within-hospital 
residual is obtained from the relation 


SSres = SSw. nospitat — (sum of main effects and two-factor interactions) 


= 590.75 — 559.40 = 31.35. 


Although the ABC interaction is completely confounded with between- 
hospital differences within a single replication, by including information 
from the two replications the ABC component may be estimated. 


Table 8.7-4 Analysis of Variance 


Source | SS df MS F. 
Between hospitals 100.19 3 
Replications 85.56 1 
Residual (b) 14.63 2 
Within hospitals 590.75 12 
А Drug А 1.57 1 1.57 == 
В Drug В 14.06 1 14.06 2.69 
С Psychotherapy 189.06 1 189.06 36.22** 
АВ 351.57 1 351.57 6123575 
АС 1.57 1 1.57 -- 
ВС 1.57 1 1.57 = 
Residual (w) 31.35 6 5.22 


к Residual wA ibe шш eee 
**F (1,6) = 13.74 


square is used as the denominator for 


all tests on within-hospital effects. The F tests show no significant inter- 
actions involving factor C (psychotherapy), but a significant main effect. 
It may be concluded from this information that the effect of psychotherapy 
is independent of the effect of the drugs; further, the groups given psycho- 
therapy showed significantly greater improvement than did groups which 
were not given psychotherapy. 

The interaction between the drugs is seen t 
following summary data obtained from part io 
interpretation of this interaction: 


The within-hospital residual mean 


o be highly significant. The 
f Table 8.7-3 are useful in the 


bo by 
Маз 

20 65 

а | 60 30 


both drugs simultaneously is not better 


These data indicate that the use of 1 
The test on the comparison between the 


than the use of either drug alone. 
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levels of drug B in the absence of drug 4 is given by 


(ABy — ABS? (20 — 65 
2nr М6) 2(4)(5.22) 


If this comparison is considered to belong in the a priori category, the 
critical value for a .05-level test is F,(1,6) = 5.99. If this comparison is 
considered to belong in the a posteriori category, the critical value for a 
-05-level test is, as given by Scheffé, 3F,,(3,6) = 3(4.76) = 14.28. (In the 
present context this test would be considered as being in the a priori cate- 
gory.) In either instance, it may be concluded that drug B has a significant 
effect upon progress in the absence of drug A. 
A test on the effect of drug А in the absence of drug B is given by 


F= (20 — 60)? 
2(4)(5.22) 


Clearly drug A has a significant effect upon progress in the absence of drug 
B. Tocompare the relative effectiveness of the two drugs when each is used 
in the absence of the other, 


_ (60 — 65)? a 
2(4)(5.22) 


Thus the data indicate that drug 4 and drug B are equally effective. 

The denominator of the F tests made in the last paragraph has the general 
form nr(Ec?)(MS,,.); nr is the number of experimental units summed to 
obtain the total in the comparison, Ус? is the sum of the squares of the co- 
efficients in the comparison, and MS,,,, is the appropriate error term for 
the comparison. 

In Table 8.7-4 the within-hospital residual sum of squares was obtained 
by subtraction. This term is actually the pooled interaction of treatment 


F 


48.49, 


= 38.31. 


0.60. 


Table 8.7-5 Computation of the Components of the Residual (w) 


Treat | Hosp. 1 | Hosp.3 |Total| Treat Hosp. 2 | Hosp. 4 | Total 
a 6 14 20 (1) 2 3 5 
| b 10 15 25 ab 4 6 10 
OT 4 6 9 15 ac 15 25 40 
abc | 8 te i12. 20 be 18 22 49. 
| 30 50 80 39 56ten 195 
o S aL 
Treat x hosp. interaction Treat x hosp. interaction 
(ii) = (6 + 10? + --- + 92 4 122) = (22 + 42 +... 4 252 + 222) 
ne с? л. +20°)/2 — (30? + 509/4] — (52 +... + 402)/2 — (39° 
+ 80°/8 = 7.00 + 56/4 + 952/8 = 24.37 


(iii) SS, = 7.00 + 24.37 = 31.37 
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effects (with the exception of ABC) with the hospitals receiving the same set 
of treatment combinations. Direct computation of the residual sum of 
squares is illustrated in Table 8.7-5. Since hospitals 1 and 3 had the same 
set of treatment conditions, the data at the left in part i provide three of the 
six degrees of freedom of the residual term. If the four treatment com- 
binations common to hospitals 1 and 3 are considered as four levels of a 
single factor, then the interaction of the levels of this factor with the hospital 
factor (defined by hospitals | and 3)is part of thesum of squares for residuals. 
This interaction is computed at the leftin partii. An analogous interaction 
term is obtained from the data at the right in part i. 


8.8 Numerical Example of 2 x 2 х 2 Factorial Experiment in 
Blocks of Size 4 (Repeated Measures) 

The purpose of this experiment was to evaluate the preferences for adver- 
tisements made up by varying the size (factor A), the style of type (factor B), 
and the color (factor C). The definitions of the levels of these three factors 
are given in Table 8.8-1. 


Table 8.8-1 Definition of Factors 


Factor Level | Definition 
Size (4) а Small 
a Large 
Style (B) bo Gothic 
ы Roman 
Color (C) со Green 
сі Blue 


Аан 


The experimenter desired to have within-subject estimates on all main 
effects and interactions; however, the task of having each subject judge all 
eight combinations was not considered to be experimentally feasible. Loss 
of interest on the part of the subjects and excessive time demands (as indi- 
cated by a pilot study) ruled against the procedure of having each subject 
judge all combinations. The experimenter was willing to sacrifice pre- 
cision with respect to the three-factor interaction 1n order to keep the 
number of judgments an individual had to make down to four. The plan 
outlined in Table 8.8-2 was chosen for use. nm 

In order to keep this illustrative example simple, it will be assumed that a 
sample of six subjects was used in the experiment. | (In practice, this sample 
size would be too small.) The subjects were divided at random into two 
groups. Individuals within each group judged only four of the eight 
different make-ups. The combinations of factors judged by group I 
satisfied the relation x, + Xy + Xs =! (mod 2), and the combinations 
judged by group II satisfied the relation ху + Xa Xa = 0 (mod 2) In 
terms of the specialized notation system for 2" factorial experiments, group 
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I judged treatment combinations represented by symbols having an odd 
number of letters; group II judged the remaining treatment combinations. 
In this design the ABC comparison is completely confounded with differ- 
ences between groups; within-subject estimates are available on all other 
factorial effects. 

Table 8.8-2 Outline of Plan and Basic Data 


Group I Group II 
Person| a 5 c abc| Total Person (1) ab ac bc Total 
1 16.58. 2 8 34 4 10012 i$. 3 33 
G) 20 ей б. 24 5 11 16 10 5 42 
3 н< Ай эг 17 6 a ТТ, 27 2 t 20 
Total| 35 15 5 20 75 Total 2571451 25 10 95 
Treatcomb.: a b c abe (1) ab ac be 
Comparison 
Cell totals: 35 15 5 20 25 35 25 10 
G so Tu йкы: ak ыл! + 170 
A Беи a |, 4... 60 
В аан [Гуз Stayt J- —10 
ау) € oeste en em ap de -50 
AB Se ee cb Et 0 
АС Pu c ЕЗЕТ 0 
BC а, ақ, ae) утар 10 
ABC ИЯ DP ERE ды атын —20 
55, = (60)/24 = 150.00 SS45 —0 5850 = (10)?/24 
th 55в = (—10)?/24 = 417 55200 - 4.17 
(iii) 550 = (—50)2/24 = 104.17 SS4nc = (—20)°/24 = 16.67 
55, = (34? + 24? + 172 + 33° 4 422 ү 20?)/4 — (170)?/24 = 114.33 
SStota = (16? + 8? + +++ + 72 25) — (170)2/24 = 409.83 


The order in which a Subject judged a particular combination was ran- 
domized independently for each subject. Order could, however, have been 
controlled by means of a Latin square. (Plan 10.7-7 utilizes the Latin- 
Square principle with what is essentially the design given in Table 8.8-2. 
The number of subjects Per group would have to be a multiple of 4 in order 
to use the Latin square.) 

The comparisons Corresponding to the main effects and interactions of 
the factors are obtained at the right in part ii of Table 8,8-2. The method 
by which the ABC comparison is obtained shows clearly that this compari- 
son is completely confounded with differences between the two groups. 
The sums of squares corresponding to the factorial effects, between-subject 
variation, and total variation are computed in part iii. The over-all 
analysis of variance is summarized in Table 8.8-3. 
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The sum of squares due to the within-subject variation is obtained from 
the relation 
SSw. subj = SStotar — SSsunj 
= 409.83 — 114.33 = 295.50. 


The residual sum of squares is given by 
SSres = SSw, subj — (sum of main effects and 2-factor interactions) 
= 295.50 — 262.51 = 32.99. 
Assuming that factors A, B, and C are fixed, the residual term is the proper 
denominator for all within-subject effects. The proper denominator for 


Table 8.8-3 Summary of Analysis 


Source of variation SS df MS F 
Between subjecis 21229 Е 
ABC (groups) 16.67 1 16.67 
Subjects within groups 97.67 4 24.42 
Within subjects 295.50 18 
A Size 150.00 1 150.00 54.54** 
В Type style 4.17 1 4.17 1.51 
с Color 104.17 1 104.17 37.88** 
AB 0 1 0 
AC 0 1 0 
BC 4.17 1 4.17 1.51 
Residual 32.99 12 2.75 


** F (1,12) = 9.33 


between-subject effects is the mean square for subjects within groups. In 
this design the ABC interaction is a between-subject effect—but this inter- 
action is completely confounded with differences among groups. Generally 
there would be little interest in testing the ABC interaction. (In this case 
the F ratio would be less than 1.) t ури 

For the within-subject data in Table 8.8-3, попе of the interactions is 
statistically significant. This result implies that the main effects (if any) 
Operate independently; i.e., the main effects are additive. The tests made 
in Table 8.8-3 indicate that the size and color main effects are statistically 
significant. Inspection of the summary data in Table 8.8-2 indicates that 
large size (a,) is preferred no matter which of the styles or colors is used. 
Similarly, green (cy) is preferred to blue no matter which size or style of type 
is used. There is no statistically significant difference between the two 
styles of type used in the experiment; there is, however, a slight preference 
for the Gothic (bọ). i 

If one were to make an over-all recommendation with respect to the most 
preferred make-up of the advertising copy, the large size in the green color 
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would be the best combination. The two type styles are, statistically, 
equally good, but there is a slight observed difference in favor of the Gothic. 

Returning to structural considerations underlying the analysis of this 
design, the residual term in the analysis consists of the following pooled 


interactions: 
Subject x treatments (within group I) 6 
Subject x treatments (within group П) 6 
Residual 12 


The residual term in Table 8.8-3 may be computed directly from the inter- 
actiontermsgivenabove. The four treatment combinations assigned to the 


Table 8.8-4 Analysis of General Case of Design in Table 8.8-2 
a > SR eee 
Source of variation df 


Between subjects 2nr —1 
Replications Pl 
Groups within replications r 

ABC 1 
Residual (groups) r—i 
Subjects within groups 2r(n — 1) 
Within subjects 6nr 


Residual (within subject) 6(nr — 1) 

c— тана зету с и 
subjects within group I are considered to be four levels of a single factor; 
the subject by treatment interaction is computed by means of the usual 
computational formulas for a two-factor interaction. 

The analysis of the general case of a2 x 2 2 factorial experiment in 
blocks of size 4 with repeated measures is outlined in Table 8.8-4. In this 
analysis there are л subjects in each group, and the experiment is replicated 
r times. The analysis of the design in Table 8.8-3 is a special case of the 
more general design in Table 8.8-4. In the special case n = 3, andr = 1. 
In a design having more than one replication, the ABC interaction may be 
tested by using the subjects within groups in the denominator of an F ratio. 
This test will generally be considerably less sensitive than tests on the main 
effects and two-factor interactions. 


8.9 Designs for 3 x 3 Factorial Experiments 


А The nine treatment combinations in this experiment may be partitioned 
into three sets in such a way that differences between sets form one of the 
components of the A x Binteraction. The design outlined in Table 8.9-1 
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gives two such partitions. In replication 1, the AB (or J) component is used 
to define the sets (blocks); in replication 2, the АВ? (or Г) component is used 
to define the sets (blocks). 

The treatment combinations in block 1 satisfy the relation x, + x = 0 
(mod 3); the treatment combinations in blocks 2 and 3 satisfy the respective 
relations x, + x, = 1 (mod 3) and x, + x, =2(mod 3). Hence differences 
between blocks within replication 1 are completely confounded with the AB 
(or J) component of A x B. That is, a difference between two block totals 
in replication 1 is simultaneously an estimate of block differences as well as 
differences which form part of the AB component. (A formal proof of this 
is given later in this section.) 

In replication 2, each of the blocks contains treatment combinations which 
satisfy the relation x, + 2x, = i (mod 3). For example, the treatment 


Table 8.9-1 3 x 3 Factorial Experiment in Blocks of Size 3 


Replication 1 Replication 2 


Block 1 Block 2 | Block 3 | Block 4 | Block 5 | Block 6 


(00) (о! ) (02) (00) (02) (01) 
(12) (10) (20) (11) (10) (12) 
(21) (22) (11) (22) (21) (20) 
Component 
confounded AB (or J) AB* (or I) 


combinations in block 5 all satisfy the relation x, + 2x, = 2 (mod 3). 
Hence differences between blocks within replication 2 are completely con- 
founded with the AB? (ог Г) component of A х B. € 

Assuming one observation per cell, the analysis takes the form given in 
Table 8.9-2. Within-block information on the main effects is available 
from both replications. Within-block information on the AB (or J) com- 
ponent is available only from replication 2; within-block information on 
the AB? (or Г) component is available only from replication I. Since 
within-block information on the components of A x B is available from 
only one of the two replications, the relative within-block information on 
A X B is said to be 3. For main effects, however, the relative within- 
block information is # = 1, since both replications provide within-block 
information on the main effects. The design in Table 8.9-1 is balanced in 
the sense that the same amount of relative within-block information is pro- 
vided on each of the components of 4 B. і 

The denominator of F ratios for within-block effects is the pooled inter- 
action of the main effects with the replications. The two components of the 
A х B interaction are generally combined, and the combined interaction 
(with four degrees of freedom) is testedasaunit. If, however, the individual 
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components are meaningful in terms of the experimental variables, separate 
tests on the components may be made. In most cases the components of 
A x B that are confounded with block effects are not tested. There are, 
however, techniques for combining the between-block information on the 
interaction with the within-block information to obtain an over-all estimate 
of the interaction. Such techniques are said to recover the between-block 
information on the interaction effects; these techniques will not be con- 
sidered in this section. 

The block in this design may be an individual subject—in this case each 
subject is observed under all treatment combinations assigned to a given 
block. The block may be a group of three subjects—in this case each 


Table 8.9-2 Analysis of Design in Table 8.9-1 


Source df E(MS) 


Between blocks 
Replications 
AB (from rep 1) 
AB? (from rep 2) 
Within blocks 
A 
B 
AxB 
AB (from rep 2) 2 o; + (Deis, 
AB* (from rep 1) 2 о + (4)(2)o2,, 
Residual 4 о? 
А X rep 2 
B x rep 2 


a Er LLL a АН ы ШЫПЫ АЖ ШИА 


Subject is observed under the three treatment combinations assigned to the 
block. Тһе block may be a group of n subjects—in this case the n subjects 
may be observed under all treatment combinations within a given block. 
The analysis of this latter design takes the form given in Table 8.9-3. 

The expected values of the mean Squares have the same general form as 
those given in Table 8.9-2. The residual again provides an estimate of the 
within-block experimental error. The 12n — 8 degrees of freedom for the 
residual is the pooled interaction of the main effects with the replications and 
the interaction of the main effects with the subjects within groups. The 
breakdown of these degrees of freedom is as follows: 


lu 


N ыы 


Pnn] 
З 
+ 
е 
S 
RM 


Residual 12n — 8 
A x rep 2 
B x rep 2 


A X subject within group 12n — 6 
B x subject within group 12n — 6 
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A formal algebraic proof will now be outlined to demonstrate that the AB 
component in the design in Table 8.9-1 is confounded with block effects 
within replication 1 but is free of such confounding within replication 2. 
This proof assumes that an observation under treatment combination ab;; 
in block k provides an estimate (disregarding the experimental error) of the 
sum of the parameters on the right-hand side of the following expression, 


Хк = p F o4 F Bs + Qa + ть 


where the symbol 7, designates the effect of block k. Assuming that factors 
А and Bare fixed, Хо, = 0, Ef; = 0, Хаф)у = 0, and оар); = 0. These 


i 1 
restrictions оп the above model follow directly from the basic definition of 
treatment effects. Interactions with block effects do not appear on the 


Table 8.9-3 Analysis of 3 x 3 Factorial Experiment in Blocks of 
Size 6 (Repeated Measurements) 


Source df 
Between subjects бл —1 
Groups 5 
Replications 1 
AB (from rep 1) 2 
AB? (from rep 2) 2 


Subjects within groups 


Within subjects 12n 
A 2 
B 2 
AB (from rep 2) 2 
AB? (f ер1 
(зое 12n — 8 


Residual 


right-hand side of the model, hence the implicit assumption that such effects 
either do not exist or that such effects are negligible relative to the magnitude 


of the other effects. . 7 е қ 5 
The set of treatment combinations which satisfy the relation x, + Ха =! 
define the set J, It will be convenient to define the sum of the interaction 


effects іп the set Jy by the symbol 3(#f),,- Thus 
Жару, = (а м + (Pis + (а. 

Similarly, by definition, 
3(08) 2, = (afp) + (а о + (%B Joos 
3(«B) 7, = (ap) + (98). + (о). 


In terms of the basic model and the specialized definitions, for observations 
on treatment combinations in the set їп replication 1 of the design in 
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Table 8.9-1, 
Хө = и + о + By + (0) + m 
Хаа == и + + б, + (08), 4- т 
Xn == и + a + By + (88) + т 
Jo, = 3u + 3(«8)у„ + 37 
The symbol J, is used to designate the sum of all observations on treatment 
combinations in the set Jo. Similarly, 
Jy = Зи + 3(@B) 7, + 379, 
Ja, = 3и + (о), + 3m. 
The sum of squares for the AB component of A x Bis obtained from dif- 
ferences between these J totals. For example, 
Jg ЛЕ 3(25);, 3(xB);, + 3m, — 3779. 
Each of the other possible comparisons among the J’s also includes block 
effects. Hence, from the data in replication 1, 55; is completely con- 


founded with differences among blocks. 
In contrast, for replication 2, 


Xoo = и + % + Во + (%B)oq + т, 
X= Br o, + В, + (о), + 7% 
Xn == p + в + B, + (08), + т 
Jo, = 3и 4 Хар), + т + ть + mg 
Similarly, for replication 2, 
Л. = Зи + ар), + т Ts TiTe 
Ja, = 3p + ой), + та + ть + т. 


Differences between J totals in replication 2 are free of block effects. For 
example, 


о ess 3(%B),, = Хор). 
Hence within-block information on the АВ components of A x B may be 
obtained from replication 2. 
To show that replication 1 provides within-block information on the 
AB? (or Г) component of 4 x B, 
Хо == y + в + By + («Ву + т 
Хуү== ut % + By + (2B) + Ng 
Xo =u + аҙ + By + (98), + gr 
Io, = Зи + 3(2B);, TE + my т 
Similarly, for replication i5 
i= 3u + 3(xp);, TT + ту + тз, 
1 = 3u + Хаф), + m + ть + лу. 
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Since each / total contains the same block effects, differences between 
these totals will be free of block effects. Hence information on the Z com- 
ponent of A x B obtained from replication 1 will not be confounded with 
block effects. 

For the general case of a design having r replications (one observation per 
cell) of the design in Table 8.9-1, with r/2 replications of the form of 
replication 1 and r/2 of the form of replication 2, the sum of squares for the 
J component of A x Bis 
ол Quy 
3(r[2) 9/2) ° 


where J totals are restricted to the replications in which the J component of 
A x Bis free of block effects, Similarly, the sum of squares for the / com- 
ponents of A x Bis given by 


SS4pQ) = 


ss’ ED HOA 
480 350) 9c 
where the / totals are restricted to replications in which the / component of 
A x Bis free of block effects. ) 

If the symbol AB,, represents the sum of all observations on treatment 
combination а, then this sum will not be free of block effects, since all 
treatment combinations do not appear in each of the blocks. An adjusted 
sum which is free of block effects is given by 
A; +B; , Јак). өз). б 

3 m 2 3 


АВ), 


where A, = sum of all observations at level a;, 
B, = sum of all observations at level b;, "t г : 
«+. = sum of all observations on treatment combinations which satisfy 


the relation x, + хә = i + j (mod 3) in replications in which 
differences between J's are free of block effects, А à 
Lisz. = sum of all observations on treatment combinations which 
satisfy the relation x, + 2x2 = i + 2j (mod 3) in replications 
in which differences between /'s are free of block effects. 
For example, the adjusted total for all observations made under treatment 
combination abos is 


In terms of the general linear model, this last. expression estimates the pa- 
rameters on the right-hand side of the following expression: 
АВ = ги + ras + rfa + 108), + Қаф), 
sum of all block effects 
3 
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Since all adjusted totals will contain this last term, differences between the 
adjusted totals will be free of block effects. To demonstrate that АВ), 
actually does estimate the parameters on the right-hand side, 


da ru + ға + 3(sum of all block effects) 


2. ти + rf, + 4(sum of all block effects) 


^з = ги + қаЙ),, + 3(sum of blocks in which J is free of block effects) 
2 


1. 
T == ги + r(%P);, + 3(sum of blocks in which I is free of block effects) 
p 

-- = —3ru — (sum of all block effects) 


ABos = ru + ға + rf, + "ай, (В), 


‚ Sum of all block effects 


In making comparisons among the means of treatment combinations, 
One uses the adjusted totals to obtain the means. The effective number of 
observations in each of these adjusted means is less than г, since information 
on the Гапа J components of the 4 X B interaction is obtained from only 
one-half of the replications. For most practical purposes, however, the 
adjusted cell means may be handled as if they were based upon r replica- 
tions. The adjusted mean for cell ab, is 


АВ), 
A 


If this design had r replications with n observations per cell, 48; would 
have the definition given above but 


АВ, т 


8.10 Numerical Example of 3 x 3 Factorial Experiment in Blocks 
of Size 3 


_ In the experiment to be described it was desired to have within-subject 
information on all factorial effects ina 3 x 3 factorial experiment. How- 
ever, it was not experimentally feasible to have each subject observed under 
each of the nine treatment combinations. Each person could be observed 
under three treatment combinations. The design outlined in Table 8.9-1 
was selected for use, 

The experimenter wished to have two observations under each treatment 
combination. A random sample of 12 subjects was obtained from a speck: 
fied population; the sample was divided at random into six groups of 2 
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subjects each. The groups were then assigned at random to the blocks in 
Table 8.10-1. The 2 subjects within each group were observed under each 
of the three treatment combinations in the assigned block. The order in 
which a subject was observed under a treatment condition was randomized 
foreachsubject. The data obtained from this experiment are summarized 
in Table 8.10-1. 

Table 8.10-1 Summary of Observed Data 


Block | Person Total 
(00) (12) (21) 
1 1 714 7 15 36 — P, 
2 6 3 5 14 =P, 50 = б, 
(01) (10) (22) 
2 3 АСЫРЫ T 10 17 =P; 
2 7 6 30 43 — P, 60 = Gy 
(02) (11) (20) 
3 5 E 15 7 27 = P; 
6 5 5 3 13=P, 40=G; 
i 150 — R, 
(00) (11) Q2 
4 7 710 10 15 35 =P, 
8 15 20 25 60 = P, 95 = б, 
(02) (10) Q1) 
5 9 Sarge 12 20 = P, 
10 7 10 18 35 =Po 55-0; 
Т ой a2 00 
11 e e УЙ. Л) 18 = Py 
$ 12 4 3 5 12-Р; 30-6, 
180 = Ry 
Бйз оор трнд рну ы уз = эз. 


In the row having person 1 at the left, the entries 14, 7, and 15 are the 
respective observations (with order randomized) on person 1 under treat- 
ment combinations (00), (12), and (21). The sum of these three observa- 
tions on person 1 is denoted by the symbol P;. The symbols G; denote the 
totals for the six observations made within a group (block). The symbols 
R, denote the totals of the 18 observations made within a single replication. 
Additional data required for the computation of the sums of squares in the 
analysis are given in Table 8.10-2. | : 

The sums of squares for all effects except the A x Binteraction аге com- 
puted in the same manner as that used in any factorial experiment having 
Iepeated measures. Data required for computation of all effects except 
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the components of A x Bare summarized in parts ii and iii of Table 8. 10-2, 
From the data in part iii, the between-subject sum of Squares is given by 
PRG. 
3 36 
The within-subject sum of Squares is given by 


3815.33 — 3025.00 = 790,33, 


rxx? = 4494 — 3815.33 = 678.67. 


Data required for the sums of Squares for groups, replications, and the 
main effects of factors A and B are also given in part iii. 


Table 8.10-2 Summary Data Required for Analysis 
Replication 1 
ж A ps 


Replication 2 


B by 5 
4, 20. 10 10 4) 1297-19 10 
Se 229 
@ а 10 7207 10 а 15 30 10 
Е А > 
ir the к 40- — -80 = h, Jz, = 50—а,——10: 307 40---95 
0a — 20° а, -~40 = 1, Jo, = 65—-a,— 25^ 10 10-55 
yes EC M 
.$0——«,——10^ 20 ^10-..39— h, ti, = 65—a,—15~ 30 `10-- -30 
150 150 180 180 
Combined replications 
b, b, b, Total 
| 45 120. 180%, %- 
Cii) 


90 120 120 330=G 


в B в, 


(ZP*)3 = 3815.33 
(iii) ZIX? = 4494 (®б®)/3л = 3441.67 
(24*)/3nr = 3229.17 (ŒR?)/9n = 3050.00 

(2B?)/3nr = 3075.00 


From the data in part i, both the between- 
ponents of the A x B interaction may 
the right of replication 1, the within: 
ponent of A x Bis given by 


xm ©)? 

3n 9n 

_ 80? + 40? + 302 (150)? 
3(2) 18 


SS p) = SS,5: = 


= 233.33. 
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From the totals at the left of replication 2, the within-subject information 
on the J component of A x Bis given by 


nin IE (ӘЛЕ 
SS 4л) = SS45 = een e 
2 21 2 2 
50 2 +65 ав) et 


Thus within-subject information on the components of the A x B inter- 
action is obtained separately from each of the replications—one component 
from replication 1, the other from replication 2. 


Table 8.10-3 Summary of Analysis 


ЕЕ ice i ae gor ims 
Source SS df MS F 
Between subjects 790.33 11 
Groups (blocks) 416.67 5 
Replications 25.00 1 
AB (from rep 1) 33.33 2 
АВ? (from rep 2) 358.33 2 
Subjects within groups 373.66 6 62.28 
Within subjects 678.67 24 
A 204.17 2 102.08 9.82** 
B 50.00 2 25.00 2.41 
A x B (adjusted) 258.33 4 64.58 622% 
АВ (from rep 2) 25.00 2 
АВ? (from гер 1)| 233.33 2 
Residual 166.17 16 10.39 


жж Е,,(2,16) = 6.23; Ғо(4,16) = 4.77 


The between-subject information on the J component of A x B is 
obtained from the totals at the left of replication 1. 


402 + 50° + 608 (150) 
SS 4л) = SSaz = 6 18 33.33. 


The between-subject information on the Icomponent of A x Bis obtained 


from the totals to the right of replication 2. 


952 + 552 4- 30? (1807 | 
85 4р = SS4p* 7 т 358.33. 


n Table 8.10-3 is obtained from the relation 


The residual sum of squares i 
85, = SS, subj — $84 — SSB SS,,(within) — SS, j(within). 


The residual sum of squares may also be computed directly from the inter- 
actions of factor A and B with the replications and subjects within groups. 
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Unless there is an a priori reason for handling the components of the 
A x Binteraction separately, a single test is made on the combined within- 
subject components of this interaction. (In cases in which the large cell 
values in a two-way summary table fall along the diagonal running from 
upper left to lower right, the AB? component will be large relative to the AB 
component.) In this numerical example the А x B interaction is statisti- 
cally significant. The simple effects for both factor A and factor B are 
between-subject effects; however, approximations to tests on simple effects 
may be obtained by working with the adjusted cell totals. Differences 
among the latter totals may be considered to be within-subject effects. 

To illustrate the computation of the adjusted cell totals, for treatment 
combination аб; 


" Ay + В, I, + J. G 
ABI ае е йш Mie анаан 
Б Е 3 
8 12 
% E 9,403.50 330 uem 
% 


(Note that in part i of Table 8.10-2 cell abo contributes to the Л. total in 
replication 1 and to the J}. totalin replication 2.) Тһе unadjusted cell total 
is 20. As another example, the adjusted total for cell ab,, has the form 


Comparisons among the adjusted cell totals use the within-subject residual 
mean square as an error term. For approximate tests, the number of 
effective observations in each cell is considered to be nr — 4. 

To illustrate the procedures for tests on adjusted cell totals, 
(АВ — АВ)? 

2nr MSres 


Тһе adjusted total 48) was found to be 18.33. Тһе adjusted total АВ 
is given by 


F= 


(Note that aby. contributes to the 7, total in replication 1 and to (һе, total 
in replication 2.) 


fl 150 2 
АВ = TUS 565. 340 
3 2 3 
= 76.67. 
Thus, r= (18.33 — 76.67) =40 
2(2)(2)(10.39) qd 


The critical value for a .05-level (a priori) test is ЕЁ (1,16) = 4.49. 
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8.11 Designs for 3 x 3 x 3 Factorial Experiments 


The 27 treatment combinations ina3 x 3 x 3 factorial experiment may 
be divided into sets in several different ways and balance still be maintained 
with respect to main effects and two-factor interactions. Each of the 
following components of the A x B x C interactions can serve to define 
such balanced sets: 


df 

AxBxC 8 
ABC 2 
АВС? 2 
АВ?С 2 
АВ?С? 2 


For example, the sets formed by using ће ABC component are as follows 


Xi + xa + x, = 0 (mod 3) ху + Xa + % = 1 (mod 3) x, +з + % = 2 (mod 3) 


(000) (001) (002) 
(012) (010) (011) 
(021) (022) (020) 
(102) (100) (101) 
(111) (112) (110) 
(120) (121) (122) 
(201) (202) (200) 
(210) (211) (212) 
(222) (220) (221) 


A design consisting of these three blocks would provide within-block 
information on all effects except the ABC component of the three-factor 
interaction. With only a single replication (assuming onlyone observation 
per cell) no estimate of experimental error is available. 

If the experiment were to be replicated by using the same block structure 
as that given above, the design would lack balance with respect to the com- 
ponents of the three-factor interaction. That is, within-block information 
would be available on three of the four components of this interaction, but 
no within-block information would be available on the ABC component. 
A design which has balance with respect to the components of the three- 
factor interaction may be constructed by means of the scheme outlined in 
Table 8.11-1. A minimum of four replications is required for balance. 
The blocks within each of the replications are constructed by means of 
relations associated with a component of the three-factor interaction; each 
of the replications involves a different component. Hence within-block 
information on all of the three-factor components is available from some 
three of the four replications. ” (44 КГУ. 

The structure of the blocks defined by the design in Table 8.11-1 is given 
in Table 8.11-2. Blocks in replication 1 are balanced with respect to all 
main effects and all two-factor interactions. These blocks are also balanced 
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Table 8.11-1 Construction of Blocks of Size 9 


Baer Component H : 
Replication | Block eaten ded Defining relation (mod 3) 
1 1 (ABC), XQ Pity itty = 0 
2 (ABC), ха +x =1 
3 (ABC), Ху +X + x3 =2 
2 4 (ABC?), xy XQ + 2x, =0 
5 (ABC), ху + хә + 2х3 = 1 
6 (АВС?), Xp xX, + 2x5 = 2 
3 7 (AB*C), Xj + 2x, + xy =0 
8 (AB*C), ху + 2x, + хз = 1 
9 (АВС), хі + 2х3 +x, = 2 
4 10 (АВС?) Xp + 2x, + 2x4 =0 
1 (AB*C?), Xpbt2x,42x,—1 
12 (AB*C?), XpT2x,42x,—2 


Table 8.11-2 Design Corresponding to Table 8.11-1 


Replication 1 Replication 2 
Block 1 Block 2 Block 3 Block 4 Block 5 Block 6 
(000) (001) (002) (000) (002) (001) 
(012) (010) (011) (011) (010) (012) 
(021) (022) (020) (022) (021) (020) 
(102) (100) (101) (101) (100) (102) 
(111) (112) (110) (112) (111) (110) 
(120) (121) (122) (120) (122) (121) 
(201) (202) (200) (202) (201) (200) 
(210) Q11) (212) (210) (212) (211) 
(222) (220) (221) (221) (220) (222) 
ABC ABC? 
Replication 3 Replication 4 
Block 7 Block 8 Block 9 Block 10 Block 11 Block 12 
(000) (001) (002) (000) (002) (001) 
(011) (012) (010) (012) (011) (010) 
(022) (020) (021) (021) (020) (022) 
(102) (100) (101) (101) (100) (102) 
(110) (111) (112) (110) (112) (111) 
(121) (122) (120) (122) (121) (120) 
(201) (202) (200) (202) (201) (200) 
(212) (210) (211) (211) (210) (212) 
(220) (221) (222) (220) (222) (221) 


AB’C ABC? 
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with respect to all components of the three-factor interaction except the 
ABC component. To illustrate the balance with respect to the АВС? com- 
ponent within block 1, the nine treatment combinations in this block may be 
divided into the following sets: 


(АВС?), (ABC?) (ABC?) 
(000) (021) (012) 
(120) (111) (102) 
(210) (201) (222) 


Differences between the totals for these sets form part of the АВС? com- 


ponent. 
Table 8.11-3 Analysis of Design Having Block Size 9 
(One Observation per Cell) 


Source of variation df E(MS) 
Between blocks п 
Replications 3 
Blocks within reps 8 
ABC (from rep 1) 2 
ABC? (from rep 2) 2 
АВ?С (from гер 3) 2 
AB'C* (тот rep 4) 2 
Within blocks 96 
we 2 o? + 3602 
B 2 оў + 3605 
с 2 o? + 3602 
AxB 4 оў + 12055 
AxC 4 о? + 120%, 
BxC 4 оё + 120}, 
(AxBxCy Ф 8 оё + Gea, 
АВС (omit rep 1) 2 
ABC? (omit rep 2) 2 
ABC (omit rep 2 > 
АВС omit ге) 
Residual ^ 2 70 в 
Main effects x reps 18 
2-factor int x reps 36 
3-factor int x reps 16 


ation per cell, the analysis of this design has the form 
given in Table 8.11-3. The computation of the sums of squares for main 
effects and two-factor interactions follows the same rules as those of the 
replicated factorial experiment. In computing the within-block compo- 
nents of A x B x C, data from replications in which a component is con- 
founded with block effects are not used. For example, to compute the 


Assuming one observ: 


426 FACTORIAL EXPERIMENTS WITH SOME INTERACTIONS CONFOUNDED 


within-block component of ABC”, data from all replications except replica- 
tion 2 are combined to form a three-way summary table. From this sum- 
mary table one obtains the totals (48C?);, (ABC*);, and (ABC?);, where 
(АВС?); represents the sum of observations in all cells which satisfy the rela- 
tion x, + X + 2x = i (mod 3). Then 


Хавс)?  [XABCYD 
27 81 ^ 


55 nc: 


Asa partial check on the numerical work, X(4 BC?) is equal to the sum of all 
observations in replications 1, 3, and 4. 

The within-cell residual is most conveniently obtained by subtracting the 
within-cell treatment effects from the total within-cell variation. The 
sources of variation which are pooled to form the residual term are shown 
in Table 8.11-3. These sources may be computed separately. In this 
design there are only three effective replications on each of the components 
of the three-factor interaction; thus the ABC x replications interaction, for 
example, has four degrees of freedom. Hence the three-factor x replica- 
tions interaction has a total of 16 degrees of freedom—4 degrees of freedom 
for each of the four components of the three-factor interaction. 

In making tests on differences between adjusted cell means, adjusted cell 
totals are required. Such totals are given by 


ABC; = $[A; + B; + С, 4 (АВ), .; + (АВ?) 4; + (АС): +ь 
+ (АС 2] + @([(АВС) ++» + (АВС); e 
25 (AB’C) i495 4% 38 (АВС?) s; s] — 46. 


For example, the adjusted cell total for the cell арс, is 


АВС» = $[ А + By + С, + (AB), + (АВ?)„ + (АС), + (АС?)у] 
+ (АВС), + (АВС?), + (АВС); + (АВСД — $G. 


The symbol (4B*C?), designates the sum of treatment combinations which 
satisfy the relation x, + 2x, + 2x, = 0 in replications in which this com- 
ponent is not confounded with blocks. This total is obtained in the course 
of computing SS! взт. 

Since the two-factor interactions in the design given in Table 8.11-2 are 
not confounded with block effects, no adjustments are required for cell 
totals in two-way summary tables. Differences between means computed 
from such totals will be free of block effects. 

Block Size = 3. Іп dividing the 27 treatment combinations into blocks 
of size 9, a relation of the general form 


и = X, + Ux, + u3x3 = i (mod 3) 
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was used for each of the replications. The relation и corresponded to some 
component of the three-factor interaction. In the analysis of the resulting 
design, the и component was confounded with block effects within the repli- 
cation having blocks defined by the wrelation. The 27 treatment combina- 
tionsina3 x 3 x 3 factorial experiment may be divided into blocks of size 
3 by requiring that the treatment combinations within a given block simul- 
taneously satisfy the two relations 


и = i (mod 3), 
v — j (mod 3), 


where v is some relation other than и. \ 
In the analysis of a replication having blocks formed in this way, the com- 
ponents corresponding to и and v will both be confounded with differences 


between blocks. In addition the components corresponding to the rela- 
tions и + v and u + 2v will also be confounded with block effects. Тһе 


latter components are known as the generalized interactions, or aliases, of 
the components corresponding to u and v. То illustrate, suppose that the 
blocks within a replication are defined by the relations 


и = ху + Xa + 2x3 = i (mod 3), 
v = X, + 2х5 = j (mod 3). 


These relations correspond, respectively, to the ABC* and the AB? com- 
ponents. One of the generalized interactions, or aliases, is given by 


u + v = 2x, + 3x, + 2x, = i + j (mod 3) 
= 2x, + 2x3 = i + j (mod 3) 
= x, + x = 2(i + j) (mod 3). 


(The last line is obtained from the one above by multiplying both sides of the 
equation by 2 and then reducing coefficients to the modulus 3.) à This last 
relation corresponds to the AC component. The second generalized inter- 


action of u and v is given by 
u +20 = 3x, + 5x44 2х3 = i + 2j (mod 3) 


= 2x, + 2x, = i + 2j (mod 3) 
= xa + xy = 2(i + 2j) (mod 3). 


This last relation corresponds to the BC component. ; 
То summarize this illustrative example, blocks are defined by relations 


corresponding to the ABC? and AB? components. The generalized inter- 
actions, or aliases, of these components are the AC and BC components. 
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Hence, in a replication defined by these relations, the analysis would have 
the following form: 


Between blocks 8 
AB? 2 
AC 2 
BC 2 
ABC? 2 
Within blocks 18 
A BG 2 each 
AB, AC?, BC? 2 each 
ABC, АВ?С, АВ?С? 2 each 


The defining relations in this example may be symbolized by (A4 BC?), and 
(ВС?),; the aliases may be symbolized by (АС) and (ВС) оу. The 
block defined by (4BC?), and (BC?), is 

(022), (101), (210). 
It may be verified that each of the treatment combinations in the blocks also 
satisfies the relations (АС), оу = (АС), and (ВС), = (BC). 

In general, if two relations are used simultaneously to define blocks, one 
needs to know the generalized interactions of the components corresponding 
to these relations in order to carry out the analysis. There is a relatively 
simple rule for determining the aliases of any two components. Given two 
components having the general form W апа X, their aliases will have the 
general forms WX and И/Х?. To illustrate the use of this rule, given 


W = ABC? and HESAB 
one alias has the form 
WX = (АВС?)(АВ?) = A®B°C? = А?С? = AC, 
upon reduction of the exponents to the modulus 3. The second alias has 
the form 
WX? = (ABC'(AB* = (ABC?)(A?B!) = A®B°C* = B°C? = BC. 
It is immaterial which of the components is designated W and which is 


designated X, since the aliases are also given by the general forms W°X and 
WX. As another example, the aliases of the components АВ and AC are 


WX = (AB)(AC) = АВС = АВ?С?, 
WX? = (АВАС)? = ABC = BC*. 
To construct a design having within-block information on all components 
of the interactions, and yet provide complete information on all main effects, 


the scheme presented in Table 8.11-4 may be used. The aliases associated 
with the pairs of defining relations are indicated. The design represented 
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Table 8.11-4 Construction of Blocks of Size 3 


Replication | Defining relations Aliases 


1 ABC, АВ? AC?, BC? 
2 ABC?, АВ? AC, BC 
3 AB*C, AB AC?, BC 
4 AB*C?, AB AC, BC? 


by this scheme will provide some within-block information with respect to 
all interaction components. For example, the AB? component is con- 
founded with block effects in replications 1 and 2 but is free of block effects 
in replications 3 and 4. Hence the relative within-block information on 
this component is 1. The AC? component is confounded in replications 1 
and 3 but is not confounded in blocks 2 and 4. It will be found that all the 
components of two-factor interactions are confounded with blocks in two 
of the four replications but are free of such confounding in two of the four 
replications. Each of the components of the three-factor interaction is 
confounded with block effects in one of the replications but is free of block 
effects in three of the four replications. Hence the relative information on 
the components of the three-factor interaction is }. 

The blocks corresponding to the design outlined in Table 8.11-4 are 
obtained by subdividing the blocks in Table 8.11-2. Block 1 in the latter 
table is subdivided as follows: 


(AB?), (AB) (AB*), 
70000). (021) (012) 
(111) (102) (120) 
(222) (210) (201) 


Each of the treatment combinations within a block satisfies the relation 
(ABC), as well as the relation heading the block. Block 2 in Table 


8.11-2 is subdivided as follows: 


(AB), (АВ?), (АВ?), 
001) 022) (010) 
(112) (100) (121) 
(220) (211) (202) 
Block 3 in Table 8.11-2 is subdivided as follows: 
(АВ?) , (АВ?), (АВ?), 
(002). (020) (011) 
(110) (101) (122) 
(221) (212) (200) 


The nine blocks of size 3 that have just been constructed form replication 1 
of the design in Table 8.11-4. қ p 

The over-all analysis of the latter design takes the form given in Table 
8.11-5. Within-block information on the main effects is complete; hence 
no special computational procedures are required. In computing the АВ 
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component, only information from blocks in which this component is not 
confounded is used. Blocks of this kind are in replications | and 2; the 
blocks in replications 3 and 4 do not provide within-block information on 
AB. All within-block information on AB? is obtained from replications 
3and4. Hence the sum of squares for AB? is computed from a summary 
table prepared from replications 3 and 4. Given this summary table, 


(АВ?) + (AB)? + (AB)? — [X(ABy? 
18 Sa 


where (AB); is the sum of observations on all treatment combinations in 
replications 3 and 4 which satisfy the relation x, + 2x, =i (mod 3). Asa 
partial check on the numerical work X(AB2); is equal to the sum of all 
observations in replications 3 and 4. 

Other components of the two-factor interactions are computed in an 
analogous manner. The within-block information on AC is obtained from 
replications 1 and 3; similar information on AC? is obtained from replica- 
tions 2 and 4. 


SS 


Table 8.11-5 Analysis of Design in Table 8.11-4 


Source of variation df 
Between blocks 35 
Replications 3 
Blocks within reps 32 
Within blocks 72 
Main effects: 4, B, C (6 — (each) 


Two-factor int: 
AB, АВ? (4 information) 4 (2 each) 
АС, AC? ($ information) 4  (2each) 
BC, ВС? (1 information) 4  (2each) 
Three-factor int: 
ABC, АВС?, АВ?С, AB*C? (3 information) 8 (2 each) 
Residual 46 


тсе ы mr EE 


Within-block information on the three-factor interaction components 15 
obtained from some three of the four replications. For example, informa- 
tion from replications 2, 3, and 4 is combined to obtain the ABC component. 
Given a summary table of observations from these three replications, 


55,5 = BOP + (ABO? + (ABO? + (ABO — [ECABO] 
27 81 
where (ABC); is the sum of observations in replications 2, 3, and 4 satisfying 
therelationx, + x, + x4 = i (mod 3). The other components of the three- 
factor interaction are computed in an analogous manner, the ABC? com- 
ponent, for example, being obtained from replications 1, 3, and 4. Since 
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components of both two-factor and three-factor interactions are partially 
confounded with block effects, estimates of cell means require adjustment 
for block effects. Тһе latter adjustment is most readily made by adjusting 
celltotals. Adjusted cell totals are given by 


АВС), = %(4; + В, + С) 
+ SB, + (АВФ, + (АСУ, + (AC) gs + (ВС); 4 + (ВС); s] 
+ zs [(4BC)., 54% zi (АВС?); 54.0% an (AB°C)i зу, ar (ABC!) уэ] 
°С. 
The primes indicate that summations are restricted to replications in which 
а component is not confounded with block effects. The sums required for 
obtaining an adjusted cell total are obtained in the course of computing the 
within-block components of the interactions. In terms of the basic linear 
model, it may be shown that 


АВС = “Ги Б Ё; + ys + (x); + (ку) + (BY) sx 
+ («By); se] + 4 (sum of all block effects). 


The right-hand side of this last expression represents the parameters that 
would be estimated by a cell total if each block contained all treatment com- 
binations. Differences between two adjusted totals will be free of block 
effects, since the last term on the right-hand side is a constant for all cells. 
With the exception of the block effects, each of the parameters on the right- 
hand side is estimated independently of all others. Estimates of the pa- 
rameters associated with main effects are each based upon four effective 
replications. However, estimates of parameters for two-factor interactions 
are based upon two effective replications, and estimates of parameters for 
three-factor interactions are based upon three effective replications. In 
making comparisons among adjusted cell totals in the three-way table, for 
most practical purposes one may consider each of these adjusted cell totals 
as being based upon four replications. Exact methods for taking into 
account the difference in the effective number of replications for the separate 
parts of the adjusted cell totals are, however, available [cf. Federer (1955, 
chap. 9)]. 

An adjusted total for a cell in a two-way summary table, the cell ab,,, for 
example, has the form 


ABj, = KA, + В) + (АВ), + (АВ? +] — 16. 
Similarly an adjusted total for the cell ас, has the form 
ACh = KA; + Cy) + АС), + (AC) ы] — 16. 


For comparisons among adjusted totals in a two-way summary table, 
the effective number of observations in a total varies as a function 
of the nature of the comparison. The difference between adjusted totals 


| 
а 
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which are in the same row or column of a two-way summary table is based 
upon (2)(12) effective observations, whereas the difference between adjusted 
totals which are not in the same row or column is based upon (3)(12) effective 
observations. Comparisons involving mixtures of these two kinds of dif- 
ferences have effective numbers of observations which may be determined 
by the general expression for the variance of a linear combination of inde- 
pendent terms. 


Table 8.11-6 Analysis of Design in Table 8.11-4 with 
Repeated Measures 


Source of variation df 


Between subjects 36n — 1 
Groups 35 
Subjects within groups 36(n — 1) 

Within subjects 72n 
Main effects: 4, B, C 6 (2 each) 
Two-factor int: 

AB, АВ? (3 information) 

AC, АС? (1 information) 

BC, BC? (& information) 
Three-factor int: 

ABC, АВС?, AB*C, АВС? (1 information) 8 (2 each) 
Residual 72п — 26 


SSS SE See 


So far the design outlined in Table 8.1 1-4 has been considered for the case 
in which there is one observation per cell. Several variations on this basic 
design are possible. One variation might be to have the block represent à 
group ofn subjects. Each ofthe groups would be assigned at random to the 
blocks, and each of the n subjects within a group would be observed, in a 
random order, under each of the treatment combinations within a given 
block. The analysis of the resulting design would have the form shown in 
Table 8.11-6. Thus, with only three observations per subject, within-sub- 
Ject estimates of all factorial effects are available. 

To summarize the principles underlying the construction of designs for 
3 x 3 x 3 factorial experiments, blocks of size 9 may be constructed by 
utilizing relations Corresponding to three-factor interactions. If this is 
done, components of three-factor interactions are either completely of 
partially confounded with block effects. To construct blocks of size 3, 
the blocks of size 9 are subdivided by means of relations corresponding to 
two-factor interactions. In this case, in addition to the interaction corte- 
sponding to the defining relations, the aliases also become confounded with 
block effects. 

The principles developed in this section may be extended to include fac- 
torial experiments of the form P X р X p, where pis a prime number. Use 


(2 each) 
(2 each) 
(2 each) 


ыш 
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of components of the three-factor interaction reduces the block size to p?. 
Then blocks of size р? are reduced to size p through use of relations corre- 
sponding to the components of two-factor interactions. For example in a 
5 x 5 x 5 factorial experiment, the relations и and v may be used to obtain 
blocks of size 5. In addition to components corresponding to the relations 
и and v, components corresponding to the following relations will also be 
confounded with block effects: 


и +0, и + 20, и + 3v, и + 40. 


For a single replication with block size 5, the number of blocks required 
would be 25. Hence the degrees of freedom for blocks within a single 
replication would be 24. Within a single replication, six components of 
interactions are confounded with block effects; each component has 4 
degrees of freedom. Hence 24 degrees of freedom corresponding to the six 
components of interactions are confounded with differences between 
blocks, the latter differences also having 24 degrees of freedom. 

In terms of the multiplicative scheme, the aliases of the components W 
and Хіпа5 x 5 x 5 factorial experiment have the general form 


Wx, WX’, wx, Wx*. 


To make the notation system unique, the exponent of the first letter in an 
interaction term is made unity by raising the whole term to an appropriate 
power and then reducing exponents modulo 5. It is not difficult to show 
that WX* and (WX*)", where n is any number modulo 5, correspond to 
equivalent relations. For example, the following relations are equivalent 
in the sense that they define the same set of treatment combinations: 


АС: xy + 2x3 =i (mod 5), 
A*C* 2х1 + 4x3 = 2i (mod 5). 


812 Balanced 3 x 2 x 2 Factorial Experiment in Blocks of Size 6 


A balanced design for 3 x 2 x 2 factorial experiments in which the block 
size is 6 can be constructed at the cost of partial confounding of the BC and 
ABCinteractions. A minimum of three replications is required for balance; 
in this balanced design the relative information on BC is % and the relative 
information on АВС іѕ $. Тһе balanced design is given in schematic form 
in Table 8.12-1. In this table the symbol Kọ designates the set of treatment 
combinations satisfying the relation хз + з= 0 (mod 2), and the symbol 
K, designates the set of treatment combinations satisfying the relation 
X» + xa = 1 (mod 2). In terms of the symbols for individual treatment 
combinations, the design is given in Table 8.12-2. The spacings within the 
blocks designate treatment combinations belonging to different K sets. 

The computational procedures for all factorial effects, except the BC and 
ABC interactions, are identical to those for a factorial experiment in which 
there is no confounding. The BCand ABC interactions require adjustment 
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for block effects. A simplified procedure for computing the BCinteraction 
will be outlined for the case in which there is no confounding. The adjust- 
ments required for the partial confounding with block effects will then be 
obtained. 


Table 8.12-1 
Rep | Rep 2 | Rep 3 
Level of А - 
Block 1 Block 2 | Block 3 Block 4 | Block 5 Block 6 
а Ky K, K, Ky К, Ky 
а, К, Ky Ko K, К, Ko 
az Kı Ko Kı Ko Ko K, 
Table 8.12-2 
Rep 1 Rep 2 Rep 3 


Block 1 Block2 | Block3  Block4 | Block5 Block 6 


(000) (001) (001) (000) (001) (000) 
(011) (010) (010) (011) (010) (011) 
(101) (100) (100) (101) (101) (100) 
(110) (111) (111) (110) (110) (111) 
(201) (200) (201) (200) (200) (201) 
(210) (211) (210) (211) (211) (210) 


—————— EE EE a 


Assuming that the BC interaction were not confounded with blocks, the 
total of all observations in the set Ko, which will be designated by the symbol 
Ko, may be shown to be an estimate of the following parameters of the linear 
model (assuming one observation per cell): 

Ky, = 18u + 18(By)x,. 
The symbol (Ву) к, denotes the K, component of the BC interaction. 
Similarly, the total of all observations in set Ку, designated by the sy mbol 
Ki, may be shown to estimate 
Ky, = 184 + 18(By) к. 
Hence the difference between these two totals provides an estimate of 
Ko. — Кү = 18(By) xc, — 18(8у)к.. 
Since the BC interaction has only these two components, 
(By), + (Ву), = 0, апа hence (By)g, = —(Ву)к,· 


(The sum of all components of an interaction term in the general linear 
model is assumed to be Zero, since these terms are measured in deviation 
units about the grand mean.) 
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Suppose that the symbol (fy) is used to denote (бук, Then 
—(#у) = (бук, 


Ko, — Кү = 18(By) — [—18(By)] = 36(By). 


Thus, if there were no confounding with blocks, the difference between the 
sum of all observations in set Ky and all observations in set K, estimates a 
parameter which is equal in absolute value to the difference between the K 
components of the BC interaction. Since the BC interaction has only one 
degree of freedom, there is only one independent parameter associated with 
it. In terms of the parameters of the model, disregarding the experimental 
error, the sum of squares for the BC interaction would be 


SSpc = 18(Ву)к, + 18(8y)c, = 36(8y)*. 
Thus a computational formula for the BC interaction, if there were no con- 
founding with blocks, would be 
(Ko. — Ky 
36 


In the design in Table 8.12-2, however, the BC interaction is partially 
confounded with block effects. The difference between the total of all 
Observations in the set Ky and all observations in the set K, actually estimates 
a mixture of (Ву) and block effects, i.e., 

Ky, — К, = 36(By) + 2(тә + 74 + Te — 7, — ту — ту), 
Where the z's represent block effects. If P, represents the total of all 
Observations in block i, 
ФР, + P, + P4 — P, — Pa — Ps) = 4(By) + 207 + ma + mg — m та — ту). 
The quantity on the left-hand side of this last equation may be used to adjust 
the difference Ky, — К, for block effects. The adjusted difference takes 
the form 


Hence 


55е = = 36(8)?. 


Q = Ko, — К, — (Р, + Pa + Pe — Р, — Ps — Pr) 


= 32(By). 
Thus the adjusted difference between the K totals provides 32 effective 
observations in the estimation of (fy); but if there were no confounding 
with blocks, the unadjusted difference between the K totals would provide 
36 effective observations. Hence the relative effective information on BC 
is $2, or $. The adjusted sum of squares for BC takes the form 


/ „0% 2 
5ӛс-% (By 


The problem of finding the appropriate adjustment for the ABC inter- 
action follows the same general pattern as that which has just been indicated 
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for the BC interaction. A simplified procedure for finding the ABC inter- 
action will first be outlined for the case in which there is no confounding. 
The symbol Ко; will be used to designate the subset of all treatment com- 
binations in the set K, which are at level a;; similarly the symbol K;; will be 
used to designate the subset of treatment combinations in the set K, which 
аге at levela;. The totals for all observations in the respective subsets will 
be denoted by the symbols К,, and Ky. If the symbol (xfy) K, denotes ап 
effect associated with the ABC interaction, 
Ky, — Ky, = 6(aBy)x,, = бо уу, + 12(8у). 
Since the sum of the components of an interaction at a fixed level of one of 
the factors is zero, 
(xBy)i,, + (@By)x,,= 0; ^ hence («Ву)к,, = —(«Ву)к„ 

If the symbol («fy), designates either (Ву) кь OF — (ау) x, assuming there 
were no confounding with blocks, Й 

Koi. — Кү, = (fiy), + 12(0у). 
Specifically, for each level of factor A, 

Koo, — Kyo, = 12(@By) + 12(By), 

Ko. — Ky, = (fly), + 12(By), 

Koz. — Ky, == 12(«Ву)» + 12(By). 
Thus, if no confounding were present, 

E(K,;, s Ky) 
12 

The sum of the first three terms on the right-hand side of the last equation 
defines 58, сіп terms of the parameters of the model; the last term defines 
855,0. (In both cases the error component has not been included.) Hence 

(Ко. жа Ky) 

12 
Thus, if there were no confounding, a computational formula for SS 5c 
would be given by 
SS en, қ X(Ky;. = Ki.) (Ko, [23 Ky 

e 12 Spies s 

In words, the ABC sum of squares is obtained by summing the BC inter- 


action at each level of factor А and then subtracting the over-all BC 
interaction. 


= 12(aBy)p + 1 (ару + 12(«Ву)# + 36(Ву)°. 


= SSazc + SSgo- 


SSunc mm ESSpco ata; — 55вс. я 
For factorial experiments іп which some of the factors are at two levels, this 


general computational procedure is simpler than direct calculation of the 
three-factor interaction, 
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In the design under consideration the difference Ky, — Kio is not free of 
block effects. For this design 
Ко — Kio = 12(«Ву) + 12(By) + 207 + 74 + 7 — Te — т; — m). 
The blocks in which K, appears at level a, have positive signs; the blocks in 
which К; appears at level a have negative signs. In obtaining the adjust- 
ment for block effects it is more convenient to work with the expression 
ХК- Kio) = 36(«Ву% + 36(By) + 6(m, + 74 + Tg — пә — T3 — ту). 
The adjustment for block effects requires the term 
(adj ay) = Py + P4 + Pg — Pa — Ps — Ру = 16(«Ву) + 485) 
+ бт, + та + Tg — Ta — тз — ту). 
The adjusted difference used to obtain the sum of squares for the three-factor 
interaction is 
3R, = (Ко, — Kyo) — (adj ao) — Q = 20(2Ву)ь, 
where О is the quantity used in the computation of 55%. Other adjust- 
ments for the levels of factor A are 
(adj ау) = Pa + Р, + Pe — Pi — Pa — Pos 
(adj a,) = Р, + Py + Ps — Pi — Ps — Pe: 
The adjusted differences used in the computation of SS нс are 
ЗА, = 3(Kn. — Ku) — (adi а) — 0 = 20(«Ву)› 
ЗК, = (Коз, — Къ) — (adj aa) — 0 = 20405). 
Ап adjusted difference of the form 3R, provides 20 effective observations 
in the estimation of the parameter (ару) whereas the corresponding un- 
adjusted difference provides 36 effective observations. Hence the relative 


information for the ABC interaction in this design is $4 = $. А computa- 
tional formula for the adjusted sum of squares for the ABC interaction 18 


given by 


2 
lnc = e = Cp) + («Ву + GP). 


If there were no confounding with blocks, 
59 (вс = Vo + («бу + Bye) 
ed. The ratio (%2)/12 = $, 


where the error component has been disregard ; 
all analysis of the balanced 


the relative information for ABC. The over- 
design in Table 8.12-2 is given in Table 8.12-3. In the underlying model, all 
factors are assumed to be fixed, blocks are assumed to be random, and the 
block x treatment interactions are assumed to be zero or negligible. Only 
under these stringent assumptions are the adjustments for block effects 
valid. In making F tests on all factorial effects, MS,,, forms the denomi- 
nator of all F ratios. In making comparisons between adjusted cell means, 


438 FACTORIAL EXPERIMENTS WITH SOME INTERACTIONS CONFOUNDED 
Table 8.12-3 


Source of variation SS df MS E(MS) 
Between blocks 5 
Replications 2 
Blocks within reps 3 
Within blocks 30 
A > o? + пфгоў 
B 1 оў + пргоў | 
с 1 9? + прдоў 
АВ 2 92 + nroz5 
AC 2 02 + пдо?,, 
(всу 1 оё + (проф, 
(ABC) 2 о? + (поз, 
Residual 19 o? 


the effective number of observations on the component parts should be 
taken into account. 


The adjustment for BG, is obtained by considering the parameters 
actually estimated by the total of all observations under treatment combina- 
tion bco, the levels of factor A being disregarded. There are nine such 
observations. 


ВС, = 9u + 9% + 9уо + (Ру) + (ту + лу + т) --2(т, + ma + ту). 


To obtain an adjustment for ВС, one must construct an expression which 
estimates the block effects on the right-hand side of this last equation. This 
expression is given by 


(adj ВС) = (XP, + P, + Ps) + GPs + P, + P) — (#8)2 — (06 
= (m + 7s +715) + 2(л, + та + т). 
The adjusted total for the cell beg is thus 
ВС» = ВС» — (adj ВС) = 9и + 90, + 9 + 9(BY)o0- 
The adjusted mean for cell bco is given by 


IB 
Bet A + Bo + Yo + (Ву): 


Since the treatment combination bco and bey, both belong to the set Ky, 
and since the design is symmetrical with respect to the parts of Ko, the adjust- 
ment for BC, is the same as that used for T em 
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By similar arguments and use of the relation that (Ву)к, = — (Ё) к, the 
adjustment for both ВС, and ВС, may be shown to be 
(adj ВСь) = (adj BCyo) = (00Р, + Ps + Ps) 

+ (00Р. + Р, + Po) + (33)0 — (96. 

In the arguments that have been used in the course of arriving at various 
adjustments, certain assumptions about restrictions on the parameters in 
the general linear model have been invoked. These restrictions are made 
explicit in Table 8.12-4. 


Table 8.12-4 
De a 
со сі 
bo b, bo b, bo by 
ay | («Во —GBr)o | -Cpr  (Br)o ө| (В) -(0» 
a (aby) —(«By), (әу), (аву), с | -(0)) (By) 
СЯ (By). -Оо0» | —GBy), (@Br)a ст Бр 
Sum 0 0 
Sum 0 0 0 0 


LLL ce e e e rs 
813 Numerical Example of 3 x 2 x 2 Factorial Experiment in 
Blocks of Size 6 


Data for this type of design will generally take the form given in Table 
8.13-1. The observation is given opposite the symbol for the treatment 


Table 8.13-1 
Rep! Rep 2 Rep 3 
Block 1 Block 2 Block 3 Block 6 
(000) 4| (001) 10 (001) 5 (000) 3 
(011) 5| (010) 20 (010) 10 (011) 1 
(101) 15| (100) 5 (100) 15 (100) 0 
(110) 20] (111) 10 (111) 15 (11) 5 
(201)20| (200) 10 (201) 10 (201) 20 
Q10)10| (211) 1 (210) 5 (210) 5 


P,=74| Р,-5 P, = 60 Р, =34 6-300 


combination. The make-up of the blocks is that given in Table 8.12-1. In 
the behavioral sciences the blocks may correspond to an individual or to a 


group of individuals. іе 097" 
Summary data for the factorial effects are given in Table 8.13-2. From 


part iv of this table one obtains Ky, and K, , 
Ko, = 60 + 50 = 110, 
К, = 90 + 100 = 190. 
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Table 8.13-2 
bo ы 
со сі со сі 
| 10" 201-407 10 80 
(i) a 20:0 930 30 30 110 


bo by со сі Co сі 
49 | 30 50 4| 50 30 by | 60 100 
(i) «| 50 60 (iii) 4| 50 60 (iv) 4 | 90 50 
а, | 80 30 аз | 50 60 150 150 
160 140 150 150 


From the block totals in Table 8.13-1, one finds 


Р, + Py + P, — P, — P,— Р, 56. 


P; +P, + P, — Pi — P, Б, 
3 


Hence Q = Ky, — Ki, 
=й eso 
3 
= 86133 


Having the numerical value of Q, one obtains the adjusted sum of squares 
for BC from the relation 


5560 = 


2 т! 2 
Q' (61.33) — 117.54. 
32 


32 


Expressions of the form 3(Ky;, — К\,) are obtained from part i of Table 
8.132. 


3(Koo, — Kyo) = 3[(10 + 10) — (20 + 40)] 120, 
ХҚК, — Ky.) = 3[(20 + 30) — (30 + 30)] = —30, 


3(Koo, — Къ) = 3[(30 + 10) — (50 + 20)] = —90. 
Adjustments for the above expressions are respectively, 


(adjay) = P, + P, Pg — Pg — P4 — Р, 20 


(adj а,) = P, за Бе o Py Р, 0 


(adj a) = Р, + P, 4 P ы P 36 
Sum = P, + P, +P, — P, — P, — P, = —56 
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The A's required for computing 55 с are given by 


ЗК, = 3(Keo, — Kio.) — (adj a) — Q = —38.67 
ЗК, = 3(Ku. Ku)-— (adja) — Q— 31.33 


3R: = 3(Ko2. — Kis.) (adj а) — Q 7.33 
X(3R) = 3(—80) — (—56) — 3(—61.33) 01 
Within rounding error, Z(3R) = 0. 
Table 8.13-3 

(1) = 65/36 = 2500 (7) = [X(AC)/6 = 2600.00 

(2) = Ух? = 3618 (8) = [X(BC)]/9 = 2688.89 

(3) = (®А4®)12 = 2550.00 (9) = [Е(АВС)?]]3 = 3066.67 

(4) = (582)/18 = 2511.11 (10) = (2P*)/6 = 2721.33 

(5) = (ЕС?)18 = 2500.00 (11) = Œ Rep?)/12. = 2620.67 

(6) = ГЕ(АВУУ/6 = 2800.00 
ЕТІ = (10) — (1) = 221.33 
SSreps = (11) — (1) = 120.67 
SSpiocks w. rep ЖР" (10) — (11) = 100.66 
55у, block =% = (10) = 896.67 
SS, = (3) - (1) = 50.00 
55в = (4) – (1) = 11.11 
550 = (5) - (1) = 0 
554в = (6) — (3) — (4) + (1) = 238.89 
5840 = (7) – (3) – (5) + (1) = 50.00 
SSpo(unadj) = (8) — (4) — (5) + (1) = 177.78 
SS po(adj) = SSko = (117.54) 
SS4no(unadj) = (9) — (9 — (7) — (8) + (3) + (0 + (5) – (1) = 38.89 
SS4zc(adj) = SS% gc = (42.18) 
SSres(unadj) = (2) — (9) - (10) + (1) = 330.00 
SSres(adj) = SSyes(unadj) + SSzc(unadj) + SS4no(unadj) 

= 386.95 


— 55 во(адј) - 584 вс(ай]) 


The adjusted sum of squares for ABC is 


2 
55 шо = хк) = 42.18. 
n of all sums of squares required in the 
Table 8.13-3. The residual term in this 
subtracting the total of the adjusted 
w. vox OF by the method indicated іп 


A summary of the computatio 
analysis of variance is given in 
analysis may be obtained either by 
treatment sums of squares from 55 
the table. 

The relative efficiency of a 
with a completely randomize 
magnitudes of MSpiocks w. гер 
sideration these mean squares are, 
smaller the latter mean square relative to t 
design involving the smaller block size. 


design involving small block size as compared 
d design depends in large part upon the relative 
and MS,, (adj. For the data under con- 
respectively, 33.55 and 20.37. The 
he former, the more efficient the 
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To illustrate the computation of the adjustments for the BC means, 
(adj ВС) = (8)(Р; + Ps + Ps) + (Ps + Pa + Pe) — (зь)0 — (HG 
(6)(178) + (3)(122) — (—1.92) — 75 
29.67 + 40.67 + 1.92 — 75 2.74. 
The adjusted mean is 


ВС = ВС» 


(Rd (сол) 
9 9 
The adjustment for BC), will be numerically equal but opposite in sign to 


the adjustment for ВС. Thus 


— — st 
BC, = ВС, E ТЕП М 10.81. 


6.97. 


The restrictions on the parameters underlying the linear model for a 

3 x 2 x 2 factorial experiment as shown in Table 8.12-3 may be illustrated 

numerically through use of the data in Table 8.13-2. These restrictions 

also hold for estimates of these parameters. For example, assuming no 
confounding, 

АВСур АВ АСу _ ВС» | Ay , Bo 0-6 


(но = 6 6 9 * 12 19 ' 18 36 


= —1.11. 


According to Table 8.12-3, (20) = —(aBy)ooo. Тһе latter parameter is 
estimated by 


А А ВСоо\ А Boo A Со BC Ao B, Co G 
71 Кы 
(Boo = 6 6 9 '12 18 18 36 
= 11. 
Thus (Ву = — (Ву) = («бу = —1.11. 
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The principles used in the construction of the 3 x 2 x 2 factorial design 
for block size 6 may be extended to covera3 x 3 x 3 x 2 factorial experi- 
ment with block size 6. To illustrate the construction of this design, the 
following notation will be used: 


Li; = set of all treatment combinations satisfying both the relations 
Xi + хә = i (mod 3) and ху + xy = j (mod 3). 
For example, the fourth factor being disregarded, the treatment combina- 
tions in the set Lo have the form (000—), (122—), and (211 —) Asanother 
example, the treatment combinations in the set Ly, have the form (012 —), 
(101 —), and (220—). For each of the treatment combinations in the set 
Ly, the relations x, + x, = 1 (mod 3) and x, + хз = 2 (mod 3) are satisfied. 
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Each of the sets L; consists of three treatment combinations at level dy and 
three at level d. 

The design in Table 8.14-1 represents a partially balanced design. The 
actual treatment combinations in this design are given in Table 8.142. А 


Table 8.14-1 


Level of D | Block: 1 2 3 4 5 6 7 8 9 


dy Loo Loy Los Lio Ly Li Lao Ly Lag 
d, Lis Loz Ly Іш Із La Lo Loo Lio 


Table 8.14-2 
Block: 1 2 3 4 5 6 7 8 9 
(0000) (0010) (0020) (0100) (0110) (0120) (0200) (0210) (0220) 


(1220) (1200) (1210) (1020) (1000) (1010) (1120) (1100) (1110) 
(2110) (2120) (2100) (2210) (2220) (2200) (2010) (2020) (2000) 


(0121) (0221) (0111) (0011) (0201) (0211) (0021) (0001) (0101) 

(1011) (1111) (1001) (1201) (1121) (1101) (1211) (1221) (1021) 

(2201) (2001) (2221) (2121) (2011) (2021) (2101) (2111) (2211) 
ааыа ш d 


minimum of four replications is required for complete balance. To con- 
struct the latter, the defining relations for i and j may be changed; i.e., the 
relations 

xı + Ха = i (mod 3), ху + 2x, = j (mod 3) 
will define sets of L’s which could make up a second replication. The 
assignment of the L’s defined in this manner would be identical to that 
shown in Table 8.14-1. н pow 

The design given in this table has the following restrictions imposed upon 
the assignment of the L’s to the blocks: (1) All the possible Ls occur once 
and only once at each level of factor D. (2) Within each block a subscript 
does not occur twice іп the same position; that is, Loi and L4, cannot occur 
within the same block, since the subscript zero would be repeated within 
the same position within the same block. (3) The sum of the subscripts 
(modulo 3) for L’s within the same block must be equal. For example, in 
block 9 one finds Гэ» and La in the same block; the sum of the subscripts 
for Lə is 2 +2 = 1 (mod 3), and the sum of the subscripts for Lio is 
1 4-0 = 1 (mod 3). 

In the original definition of Li;, the relations used correspond to the AB 
and AC components of the A x B and A x C interactions, respectively. 
The relative information on these components is ł}. Тһе generalized in- 
teractions of the AB and AC components аге ВС? and AB*C?, The relative 
information on BC? is 2, but there is no within-block information on the 
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АВ?С? component. The ABC, ACD, and ВС?” components are also 
partially confounded with block effects; the relative within-block informa- 
tion on each of these components is }. However, ће 482С%) component 
is not confounded with block effects. 

To show that the design in Table 8.14-1 provides no within-block 
information on the 4B?C? component, it is necessary to show that no 
comparisons which belong to this component can be obtained from 
information provided by a single block. The treatment combinations 
defining the АВС? sets are located in the following blocks: 


Blocks 
хі + 2x4 + 2x3 = 0 (mod 3) 1,6,8 
хі + 2x + 2x = 1 (mod 3) 2 
хі + 2x9 + 2x3 = 2 (mod 3) 2, 4, 9 


Thus no single block contains treatment combinations which belong to more 
than one of the АВС? sets. Hence any comparison between such sets will 
involve differences between blocks. 

The picture with respect to the 4B*C? component may be contrasted with 
that presented by the 4B component, for which there is 2 relative within- 
block information. The blocks which contain treatment combinations 
belonging to the АВ sets are the following: 

Blocks 
Jo: xı + x, = 0 (mod 3) 1:42 
Л: xı + x = 1 (mod 3) 103 
Ja: xı + x = 2 (mod 3) 2,5. 
Thus within-block information on the AB componen: 
following comparisons in the blocks indicated: 


, 


is available on. the 


сш 
et ме рь 
PAN 
оо 


Blocks 
J versus J; 1, 3,4 
Jo versus Jy 2, 7, 8 
J, versus Jy 5,6,9 


In the design given in Table 8.14-1 each J set occurs in the same block with 
each of the other J sets three times. 

_ Procedures for obtaining the adjusted sums squares for the two-factor 
interactions which are partially confounded with block effects will’ be 
illustrated by working with the AB (or J) component. The symbol J will 
be used to represent the set of treatment combinations which satisfy the 
relation x, + x, = 0 (mod 3). For the design in Table 8.14-1, these 
treatment combinations occur in cells of the form Loz, that is, cells in which 
the first subscript of an Lis zero. The symbol J,, will be used to designate 
the sum of all treatment combinations in the set Jo. In terms of the 
parameters of the general linear model, 


2Jo, = 2Y Lo; = 36и + 36(аф),, + 6(ar + т» 4 тз + Ta + Tz + 74). 
3 
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In words, the right-hand side of the last expression indicates that the total 
Jo, is partially confounded with block effects. The symbol EP; will be 
used to designate the sum of totals for blocks in which treatment combina- 
tions belonging to J, are located. Thus 


ХР, = P, + Pa + Ps + Py + P, + Р. 
In terms of the parameters of the general linear model, 
EPL = 36и + 9(«В)у„ + б(ту + т + ту + 74 + 7; + т). 

A quantity convenient for use in obtaining the adjusted sum of squares for 
the J component of A x Bis 

20 = 27, — ЕР, = 27(В) у. 
The coefficient of (26), is 36 in the 27, total and 27 in ће 20, total. The 
ratio 55 = } gives the relative within-block information for the AB com- 
ponent. 

The other quantities required for the computation of the adjusted sum of 

squares for the AB component are 

20у, = 2J,, — ХР; = 27(«8)у,, 
where P, represents а total for a block containing treatment combinations 
belonging to the set J}, and 

20,, -2/,- ЖР, = 27(«В) ,. 


The adjusted sum of squares for the AB component is given by 


2 
бал = 20020 = (ара), 


(The error component has been disregarded in this last expression.) If the 
AB component were not partially confounded with block effects, the sum of 
squares for this component'would be given by 


WY РОЛ. адад, 


[The ratio of the coefficients of Х(26)° provides the formal definition of 
relative within-block information. In this case the ratio is GOL 18 = ] 
Adjustments for the AC component of 4 x C are obtained in an analo- 
gous manner. If K; designates the set of treatment combinations which 
satisfy the relation x, + хз = j (mod 3), and if К; designates the sum of all 
Observations on treatment combinations belonging to the set K;, then 


20g, = 2K;.— X Pr, = 27(7)x;- 


For example, 20x, = 2Ko, — ОР, = 21(&у)к,. 
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The adjusted sum of squares for the AC component is 


У(20к,)° 
108 


Adjustments for the BC? component may Бе cast in the following form: 


20(всЭһ = XBC?),, + ®Р(вогу„ == 27(Ву?)„. 

In this expression (BC?),, designates the sum of all observations on treat- 
ment combinations which satisfy the relation x, +- 2x4 = m (mod 3), and 
Р, воз) designates a total for a block containing treatment combinations 
which belong to the set (BC?),,. Blocks which have treatment combina- 
tions in the latter set are those containing L,,’s which satisfy the relation 
i + 2j = m (mod 3). For example, treatment combinations in the set 
(BC?) are included in Loo, Los, and Ly, ; these L’s are located in blocks 12:3, 
5,8, and 9. Hence 


XP(goy, = Py + P, + Ps + P; + Pa + Ps. 


The process of obtaining the adjustments for the three-factor interactions 
which are partially confounded with block effects is simplified by utilizing 
the following restrictions on the underlying parameters, 


(29), ata, + (афбул,ша, = 0, 

and hence (оВд) ata, = — (98), ata, = («P6),,. 
Analogous restrictions hold for Л and J}. 

(apò), ata, = — (980), ata, = (аб). 

(280), ака, = —(#B5)y, ata, = («06),,.. 
If there were no confounding with blocks, the difference between the totals 
Jo, at a, ANd Jo, at a, would be a function only of the term (xf) y, This 
result follows from the basic definition of a three-factor interaction; in this 
case the ABD interaction is a measure of the difference between AB profiles 
at the two levels of factor D. Since the AB component is partially con- 


founded with block effects, the AB x D component will also be partially 
confounded with block effects. 


SSkuc = = (ауды. 


2Jo. ака, — 2Jo. ata, = 36(а08),, + 6(m, + ma + m, — m, — та — па). 
The last term in the above expression represents the block effects with which 
the three-factor interaction is confounded. Blocks containing Lo; at level 
d, appear with positive signs; blocks containing Гу at level d, appear with 
negative signs. То adjust for block effects, 
UP iy, ша, = DP atleveld, = 27(В%) з, + 6m, + Ta + Tg — 74 — T} — ту). 


The expression EP, , at a, denotes the sum of totals for all blocks which 
contain Lo; at level 4. The right-hand side of the above expression makes 
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use of the fact that («d),, + (00), + (89), = 0. An estimate of («f6),,, 
which is free of block effects, is given by 

2Ry, = 2Jo.ata, — 2Jo.ata, — (Раша, — UP Ly ata) 
= (abò) у. 
If there were no confounding with blocks, the coefficient of («fô);, would 
be 36. The ratio ~ = } gives the relative within-block information on the 


AB x Dcomponent of the A x B x Dinteraction. Other quantities re- 
quired in order to compute 554, p are 


26), = 2J1.atà, — 2Л. ака, — (EP Lata, + EPL, аа) 


= 9(x50) y,» 
2Ry, = We, ata, — Wo, ata, — (Рі, на, + ЭР, ua) 
= 9(a/0),.. 
The adjusted sum of squares for AB x Dis 


XOR? 
58 вир = сты = 9X(250);,,. 


Quantities required for the adjusted sum of squares for the AC x D com- 
ponent have the form 


2Rx, = 2Kj.ata, — 2Kj.ata, — (ХР, atdy + УР, ча) 
= 9(«уб)к,. 
The corresponding quantities for the BC? х D component have the form 
2Ё(всзу„ = (BC?) ata, — (BC?)nata, — (ХР(всзаша, + ХР(всЗаша)) 
= 9(8у°д)„. 
8.15 Fractional Replication 


In the designs considered up to this point, all the possible treatment com- 
binations in the factorial set were included in the actual experiment. The 
number of treatment combinations in a complete factorial set becomes quite 
large as the number of factors increases. For example, a 25 factorial 
experiment requires 256 treatment combinations; a 216 factorial experiment 
requires 65,536 treatment combinations. If higher-order interactions can 
be considered negligible relative to main effects and lower-order inter- 
actions, only a selected fraction of the complete factorial set needs to be 
included in an experiment. The cost to the experimenter is the confound- 
ing of higher-order interactions with main effects and lower-order inter- 
actions. The gain is usually a substantial reduction in experimental effort, 
accompanied by a somewhat broader scope for the inferences. In cases 
where ambiguity is present because of confounding, the initial experiment 
may be supplemented by follow-up experiments specifically designed to 
clarify such ambiguities. There are situations in which it is highly 
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desirable to run a sequence of fractional replications, the choice of the 
successive fractions being determined by the results of the preceding 
fractions. In the designs to be discussed in this section, all factors are 
considered to be fixed. 

The principles for selecting the set of treatments which will provide 
maximum information on main effects and lower-order interactions are 
essentially those followed in assigning treatments to blocks in a complete 
factorial experiment. To illustrate the kind of confounding which arises 
in a fractional replication, consider a one-half replication of a 2? factorial 
experiment. Suppose that the treatments in this one-half replication 
correspond to the treatments in the set (АВС). The latter are (000), (011), 
(101), (110). Comparisons corresponding to the main effects and inter- 
actions may be indicated schematically as follows (the columns indicate the 
weights for a comparison): 


A B с АВ АС ВС АВС 
(000) =(1)} - = = = = = = 
(O11) = be = s a zt + = = 
(101) = ac Е a + aT cT + d 
(110) — ab F E: = = F + = 


Note that the pattern of the comparison corresponding to the main effect of 
A (the column headed A) and the pattern of the comparison corresponding 
to the BC interaction (the column headed BC) are identical. (These two 
patterns would not continue to be identical if the remaining treatments in the 
complete factorial were added.) Hence, if only information on these four 
treatment combinations is available, variation due to the main effect of A is 
completely confounded with variation dueto the BC interaction. Similarly, 
by using information from the one-half replication given above, B and 
AC are completely confounded; C and AB are also completely confounded. 
The sign pattern of ABC is not that of a comparison, it will be found that 
variation due to ABC cannot be estimated. 

The effects which cannot be distinguished in a fractional replication are 
called aliases—the same source of variation is, in essence, called by two 
different names, The aliases in a fractional replication may be determined 
by means of a relatively simple rule. Consider a one-half replication of a 


24 factorial experiment defined by the relation (ABCD). Тһе alias of the 
main effect of A is given by 


A x ABCD = A*BCD = BCD. 


BCD is the generalized interaction of A and A BCD; it is also the alias of A 
in a one-half replication defined by (ABCD), or (ABCD),. The alias of 
ABis 

AB x ABCD = А?В?Ср = СР. 
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The alias of ABC is the generalized interaction of ABC and the defining 
relation, i.e., 
ABC x ABCD = D. 


In general, the alias of an effect in a fractional replication is the generalized 
interaction of that effect with the defining relation or relations, should there 
be more than one. 

The analysis of variance for a one-half replication of a 24 factorial experi- 
ment, by using (4BCD), or (4BCD), as the defining relation, is given in 
Table 8.15-1. Тһе aliases are indicated. (It will be noted that, if BCD is 
the generalized interaction of A and ABCD, then A will be the generalized 
interaction of BCD and ABCD.) Main effects are aliased with three- 


Table 8.15-1 One-half Replication of а 2* Factorial Experiment 
Beles a ee eee ss EE EE 
Source df 


A (BCD) 1 
B (ACD) 1 
С (ABD) 1 
D (ABC) 1 
1 
1 
1 


AB (CD) 
AC (BD) 
AD (BC) 
Within cell 8(n — 1) 


Total 8n—1 
—— i 


factor interactions, and two-factor interactions are aliased with other two- 
factor interactions. The four-factor interaction cannot be estimated. If 
three-factor and higher-order interactions may be considered negligible, 
then estimates of the variance due to the main effects are given by this 
fractional replication. There is considerable ambiguity about what inter- 
pretation should be made if two-factor interactions should prove to be 
significant, since pairs of two-factor interactions are completely con- 
founded. ү 
By way of contrast, consider a one-half replication of a 2° factorial 
experiment. Suppose that the treatments are selected through use of the 
relation (4BCDE), or (4BCDE),. The analysis of variance may be 
outlined as indicated below (aliases are indicated in parentheses): 


nt 


Source df 
Main effects (4-factor interactions) À 5 
Two-factor interactions (3-factor interactions) 10 
Within cell 16(n — 1) 
Total 16n—1 


NN 
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If three-factor and higher-order interactions are negligible, this fractional 
replication provides information for tests on main effects as well as two- 
factor interactions. 

A single defining relation for a 2* factorial experiment will select a one- 
half replication. If kis large, it is desirable to have a one-quarter or even a 
one-eighth replication. In order to select a one-fourth replication, the 
treatments are required to satisfy, simultaneously, two defining relations. 
Suppose that these defining relations are designated U and V. The aliases 
of an effect E will have the form 


E x U, ExV, E x (U x V). 


In words, the aliases of an effect in a one-fourth replication are the general- 
ized interactions of that effect with each of the defining relations as well as 
the generalized interaction of the effect with the interaction of the defining 
relations. 

For example, consider a one-fourth replication of a 2* factorial experi- 
ment. Suppose that the defining relations are (4BCE), and (ABDF)p. 
The generalized interaction of these two relations is (CDEF),. А partial 
list of the aliases is as follows: 


———————D 


Effect Aliases 
A BCE, BDF, ACDEF 
B ACE, ADF, BCDEF 
AB CE, DF, ABCDEF 
CD ABED, ABCF, EF 


—————————e 


In this case, main effects are aliased with three-factor and higher-order 
interactions; two factor interactions are aliased with other two-factor as 
well as higher-order interactions. 

As another example, consider a one-fourth replication of a 27 factorial 
experiment. Suppose that the defining relations for the selected treat- 
ments are (ABCDE) and (ABCFG), Тһе generalized interaction of these 
relations is (DEFG). A partial listing of the aliases is as follows: 


Effect Aliases 
A BCDE, BCFG, ADEFG 
B ACDE, ACFG, BDEFG 
AB CDE, CFG, ABDEFG 
DE ABC, ABCDEFG, FG 


In this design main effects are aliased with four-factor and higher-order 
interactions. With the exception of DE — FG, DG EF, DF — EG, all 
other two-factor interactions are aliased with three-factor or higher-order 
interactions. The two-factor interactions which are equated are aliased. 
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This design yields unbiased tests on main effects and several of the two- 
factor interactions, provided that three-factor and higher-order interactions 
are negligible. 

For a one-eighth replication of a 2* factorial experiment, three defining 
relations are required to select the set of treatments. If these relations are 
designated С, V, and W, the aliases of an effect E are given by the following 
interactions: 

Ex U, BOK АМЕ; 
Е XU уу; ЕХ" СӨРЕ Ex (V x W), E (U xv x ҮР); 
In general, if m is the number of defining relations, then the number of 
aliases of an effect is 


m+ (3) + (3) +--+ (0) 


For example, when m = 3, 
3 3 
3+ (3) +4 (3) =3+341=7 


Fractional Replication in Blocks. To illustrate the general method 
for arranging the treatments ina fractional replication into blocks, consider 


Table 8.15-2 One-half Replication of 2° Factorial 
[Defining Relation: (4BCDEF),] 


Block 1 Block 2 
(ABC), (ABC), 
(1) ab ac be ae af ad bd 
(i) abef ef de df bf be ce of 
acde  acdf  abdf асе cd abcd аў ^ abce 
жар bede bcef ^ abde abcdef сае bdef adef 
Block 1’ Block 1^ Block 2^ Block 2^ 
(ABD), (ABD), (ABD), (ABD), 


(1) ab ac be ae af ad bd 
Gi) abef ef de df bf be ce of 

acde  acdf abdf асе | cf abcd abcf ^ abce 

bcdf bede beef ^ abde | abcdef саў bdef adef 


а 2° factorial experiment. Suppose that a one-half replication of 64 treat- 
ments is selected by means of the relation (ABCDEF),. Now suppose that 
the resulting 32 treatments are subdivided into two sets of 16 treatments by 
means of the relation ABC, one set being (ABC), and the other (ABC),. 
The resulting sets of 16 treatments are given in parti of Table 8.15-2. Sup- 
pose that the latter sets define the blocks. In the analysis of the resulting 
experiment, main effects will be aliased with five-factor interactions. 
Two-factor interactions will be aliased with four-factor interactions. 
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Three-factor interactions will be aliased in pairs as follows: 


Effect Alias 
ABC DEF 
ABD CEF 
ABE CDF 


The pair ABC = DEFis confounded with differences between blocks. 

Suppose that the blocks of 16 are further subdivided into blocks of size 
8 by means of the relations (ABD), апа (ABD),. The resulting blocks are 
given іп part ii of the table. The aliases remain the same as in the previous 
analysis, but now there is additional confounding with blocks. The three 
degrees of freedom confounded with between-block differences are 

ABC = DEF, ABD = CEF, 
as well as 
ABC x ABD — CD. 

Note that the generalized interaction of DEF and CEF is also CD. 

Thus in a one-half replication of a 28 factorial experiment in blocks of size 
8, if three-factorand higher-order interactionsare negligible, maineffectsand 
all two-factor interactions except CD may be tested. Some of the three- 
factor interactions as well as CD are confounded with block effects, but 
other pairs of three-factor interactions are clear of block effects, provided 
that the appropriate model is strictly additive with respect to block effects. 

Computational Procedures. Computational procedures for a one-half 
replication of a 2* factorial experiment are identical to those of a complete 
replication of a 2* factorial experiment. For example, the eight treat- 
ments in a one-half replication of a 24 factorial experiment actually form a 
complete replication of a 2? factorial experiment if the levels of one of the 
factors are disregarded. If the one-half replication is defined by the 
relation (4 BC DY,, the treatments are 


(1), ab, ac, ad, be, bd, cd, abcd. 
If the levels of factor D are disregarded, the treatments are 
(1), ab, ac, a, be, b, c, abc; 
these are the eight treatment combinations in a 2? factorial experiment. 


Corresponding effects in the three-factor and four-factor experiments are 
given below: 


EI S P UD TN IRR 


Four-factor experiment 


Three-factor experiment (one-half rep) 
A А = BCD 
В В = АСр 
c C — ABD 
AB AB — CD 
AC AC — BD 
BC BC = AD 


ABC ABC = D 
Se 
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Suppose that a one-quarter replication of a 25 factorial experiment is 
defined by the relations (ABE), and (CDE). Тһе treatments in this frac- 
tional replication are 


(1), ab, cd, ace, bce, ade, bde, abcde. 
If factors D and E are disregarded, the treatments are 
(1), ab, с, ac, be, a, b, abc; 
these are the treatments in a complete replication of a 2? factorial experi- 
ment. The aliases of these effects are their generalized interactions with 
ABE, CDE, and ABE x CDE = ABCD. Thus there is the following 


correspondence between this 2? factorial experiment and the one-quarter 
replication of a 2° factorial experiment: 


Five-factor experiment 
(one-quarter rep) 


A = BE, ACDE, BCD 
B = AE, BCDE, ACD 
C = ABCE, DE, ABD 
AB = E, ABCDE, CD 
AC = BCE, ADE, BD 
BC = ACE, BDE, AD 
ABC = CE, ABDE, D 


That is, if a one-quarter replication of a 2° factorial is analyzed as if it 
were a complete replication of a 2* factorial, the corresponding effects 
are indicated—thus the analysis reduces to the analysis of a complete 
factorial having two fewer factors. 

Extensive tables of fractional replications of experiments in the 2" series 
are given in Cochran and Cox (1957, pp. 276-289). The plans in these 
tables permit the arrangement of the treatments into blocks of various sizes. 
To avoid having main effects and two-factor interactions aliased with lower- 
order interactions, care must be taken in selecting the defining relations for 
fractional replications and blocks. The plans tabulated in Cochran and 
Cox are those which tend to minimize undesirable aliases. 

Fractional Replication for Designs in the 3° Series. Principles used in 
the 2" series may be generalized to factorial experiments in the p^ series, 
where p is a prime number. In a 3? factorial experiment, a one-third 
replication may be constructed by any of the components of the three-factor 
interaction: ABC, ABC?, АВ?С, or AB*C?. Each of these components will 
subdivide the 27 treatments in a 3? factorial set into three sets of 9 treat- 
ments each. If one of the sets is defined by (ABC),, where i = 0, 1, or 2 
is used, the aliases of the main effect of factor A are 


A x ABC = А?ВС = АВ?С°, 
A? x ABC = BC. 
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The aliases of AB? are 
АВ? x ABC = А?В?С = AC’, 
(АВ?) x ABC = A°BC = ВС = BC, 
In general, if the defining relation for a one-third replication in a 3* experi- 
ment is A, then the aliases of an effect E are 
EXR ай E?*xR. 


The following correspondence may be established between a one-third 
replication of a 3° factorial and a complete replication of a 3? factorial; 
assume that the defining relation is (ABC); 


———M— 


3? factorial experiment 


3° factorial experiment (one-third rep) 
A A = АВ?С? = BC 
B В = АВС = AC | 
АВ АВ = АВС? = С 
AB: АВ? = АС? = BC? 


From the point of view of computational procedures, a one-third replication 
ofa 3° factorial is equivalent to a complete replication of a 3? factorial experi- 
ment. Since two-factor interactions are aliased with main effects, this plan 


Table 8.15-3 One-third Replication of a 34 Factorial Experiment 
(Defining Relation: ABCD) 


Source df | 
А = AB*CD* = BCD 2 
В = АВ?Ср = ACD 2 
С = ABC*D = ABD 2 
AB = АВС?р? = CD 2 
АВ? = AC? p? = BCD? 2 
AC = АРСЫЗ = BD 2 
АС? = AB? D! = BCD 2 
ВС = АВС?р = AD 2 
ВС? = АВ?) = АС?р 2 
ABC = ABCD* = D 2 
ABC? = ABD? = CD? 2 
AB?C = ACD? = BD? 2 
АВС? = Ар? = BCD? 2 
Within cell 27(n — 1) 
Total 271 —1 


тс мез а жен al EUR Me. 


is of little practical use unless it can be assumed that all interactions are 
negligible, 

A one-third replication of a 3* factorial experiment may be constructed 
from any one of the components of the four-factor interaction. If one of 
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the sets of 27 treatments defined by (ABCD), is used, then the corre- 
spondence given in Table 8.15-3 may be established between the fractional 
replication of a 34 factorial and a complete replication of a 3° factorial. 
Main effects are seen to be aliased with three-factor and higher-order 
interactions. Some of the two-factor interactions are also aliased with 
other two-factor interactions. If, however, the two-factor interactions 
with factor D are negligible, then the other two-factor interactions are 
clear of two-factor aliases, 

Assignment of treatments to blocks follows the same general principles 
as those given in earlier sections of this chapter. Plans for fractional repli- 
cations in the 3* series are given in Cochran and Cox (1957, pp. 290-291). 
The following notation systems are equivalent: 


AB(I) = АВ", ABC(W) = АВ?С?, ABC(Y) = АВС”, 
AB(J) = AB, ABC(X) = AB'C, ABC(Z) = ABC, 
Other Fractional-replication Designs. Latin squares and Greco-Latin 
squares may be considered as fractional replications of factorial experi- 


ments. Experimental designs in these categories will be considered in 
Chap. 10. 


CHAPTER 9 


Balanced Lattice Designs and Other 
Balanced Incomplete-block Designs 


9.1 General Purpose 


The designs discussed in Chap. 8 permit the experimenter to assign the 
treatment combinations in a factorial experiment to blocks of size k, where 
k is smaller than the pq treatment combinations in a two-factor factorial 
experiment. In balanced designs, some within-block information is avail- 
able on all main effects and interactions, the information on the interactions 
being in some cases only partially complete. 

The design principles developed in Chap. 8 may be adapted for use in 
assigning the p treatments in a single-factor experiment to blocks of size k, 
where kis smaller than p. In particular, the principles developed in the last 
chapter are useful in constructing balanced lattice designs. In an m-dimen- 
sional balanced lattice the number of treatments is k™, where k is a prime 
number, For example, in a balanced two-dimensional (double) design 
there are К? treatments. In a balanced three-dimensional (cubic) lattice 
there аге k* treatments. The block size in a balanced lattice of dimension 
mis К or k to some power less than m. 

Lattice designs are also classified according to the number of dimensions 
which are in balance, i.e., the number of restrictions on the structure of the 
design. For example, a balanced double lattice may be balanced in only 
one dimension (one restriction) or balanced in both dimensions (two restric- 
tions). A cubic lattice may have one, two, or three restrictions. In this 
chapter balanced double lattice designs having one and two restrictions will 
beconsidered. For partially balanced lattice designs as well as higher-order 
balanced lattice designs the reader is referred to an extensive treatment of 
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requiring within-subject estimates of treatment effects without the require- 
ment that each subject be observed under all treatments. Each subject is 
required to be observed only under k of the p treatments. The analysis of 
balanced incomplete-block designs assumes that a strictly additive model is 
appropriate; i.e., no interaction terms are included in the model. Also, in 
these designs the block effects are assumed to be random, and the treatment 
effects are assumed to be fixed. However, all the assumptions about con- 
stant correlations їп а repeated-measures experiment must be met if repeated 
measures are used. 


9.2 Balanced Simple Lattice 


A balanced double lattice having one restriction is called a simple lattice. 
In this design К? treatments are arranged in blocks of size k, where k is a 
prime number. The treatments must be assigned to the blocks in such a 
way that each possible pair of treatments occurs together in some block an 
equal number of times. If this condition is met, the within-block informa- 
tion on all pairs of treatments is equal—in this sense, the design is balanced. 
The minimum number of blocks required to achieve balance is k(k + 1). 

The construction of the blocks is simplified by associating each of the К? 
treatments with one of the treatment combinations in a k x k factorial 
experiment, The blocks are then constructed in terms of the treatment 
combinations of the factorial experiment. In terms of this latter type of 
experiment, the effects confounded with blocks are more readily identified. 

A simple balanced lattice in which К? = 9 will be used for illustrative 
purposes. Consider the design given in part i of Table 9.2-1. Differences 
between blocks within replication 1 are confounded with the main effect of 
pseudo factor A. Blocks in replication 2 are confounded with pseudo 
factor B. In replications 3 and 4, differences between blocks are con- 
founded, respectively, with the AB and AB? components of the interaction. 
Thus, in terms of the pseudo factors, there is } within-block information on 
each of the main effects as well as $ within-block information on each of 
the components of the interaction. In general the within-block infor- 
mation per component of a pseudo factor is k/(k + 1). 

The translation from a design in terms of pseudo factors to a design in 
terms of treatments in а single-factor experiment is given in part іі. Any 
mode of associating the symbol (ij) with a treatment may be used so long as 
the method of association is consistent for all replications. In part ii it 
may be verified that each possible pair of treatments occurs in the same block 
once and only once. For example, the pair (1,5) occurs only in block 10; 
the pair (7,9) occurs only in block 3. Balanced lattices for К® = 9, 16, 25, 
49, 64, and 81 have been constructed. No balanced lattice exists for 
k? = 36; none has been constructed for К? = 100 or 144. 

Since all the treatments do not appear in the same block, treatment effects 
are partially confounded with block effects. Conversely, block effects are 
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Table 9.2-1 Balanced Simple Lattice for Nine Treatments in Blocks 


of Size 3 
Block | Replication 1 Block| Replication 2 
1 |(00) (01) (02) x, —0 4 |(00) (10) (20)  x,—0 
2 (10) (11) (12) 4-1 5 (01) (11) (21) X; = 1 
3 | (20) (21) (22) x, =2 6 | (02) (12) (22) ж =2 
(i) Block Replication 3 Block Replication 4 


7 |(00) (12) (21) x, +х»=0 10 
8 |(01) (10 Q2) x, x, —1 11 
9 |02) (11) Q0) x, -x,—2 12 


(00 (11) Q2) x, + 2x, — 0 
(02) (10) Q1) x, + 2x, — 1 
(01) (12) (20) x, + 2x, 


Block | Replication 1 Block| Replication 2 

1 1 2:143 4 1 А p. 

2 42225556 5 ола ел ыд 

3 Jin e ec 6 S GET 

(ii) Block Replication 3 Block Replication 4 

cac See ыу... esit NC. RS 

7 1 бк «8 10 1 $4299 

8 2 а, 11 ЗЕ 1g 

9 20 mS eai. 12 177%”) 


also partially confounded with treatment effects. The primary purpose of 
this type of design is to obtain estimates of treatment effects which are not 
biased by block effects, To achieve this objective, one proceeds in a rather 
roundabout manner—estimates of block effects which are not biased by 
treatment effects are obtained first, This is done ina preliminary analysis 
of variance which takes the following general form: 


Source df MS 


Replications k 

Treatments (unadj) 1 — 1 

Blocks (adj) w. rep (К — 1)(k + 1) E, 

Intrablock error (Е — 1)(k? — 1) E 
Total 


effects is generally designated by the symbol E,; the latter mean square is 
also referred to as the between-block error term. Similarly, the symbol 
E, is widely used to indicate the within-block, or intrablock, error term. 
The expected value of these mean squares has the following general form: 


kè 
E(E,) = o? + bai Shocks; — E(E)— аё. 


| 
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Some of the principles underlying the adjustment of the block effects will 
be illustrated in terms of the design given in Table 9.2-1. Toward this end, 
the following notation will be used: 


T, = sum of the k + 1 = 4 observations on treatment i. 
B, = sum of the k = 3 observations within block j. 
Ba, = УВ, = sum of the block totals in which treatment i appears. 
(i) 


In terms of a strictly additive model, 
Т, = Au + 4r, + Ву + Ba В, + зо. 
(Terms involving е are not included.) Since treatment 1 appears in blocks 
1,4, 7, and 10, the corresponding block effects appear on the right-hand side 
of the above expression in addition to the direct effect associated with treat- 
mentl. The totals for the blocks in which treatment | appears estimate the 
following parameters of a strictly additive model: 
B, = Зи + ті Ет Ат: + 3, 
В, = 3p +r, + Ta +77 + 3%, 
B, = Зи + Ti + Te + Te + 3%, 
Bio == 3p + тү + т; + ту + 300 
In the model, Ут; = 0; hence, 
Bay = В, + B, + By + Bio = 12 + Эту + 3(В, + Ba + Br + Bio) 
The grand total of the k?(k + 1) = 36 observations in the experiment pro- 
vides an estimate of 364. Hence, 
W, = ЗТ, — 4Bqy + G = —9(y + В, + В; + Bro) 
That is, W, is a function of the sum of block effects in which treatment 1 
appears. In general, 
W, = КТ, — (k + Bo + G = me d 


Thus each W, provides an estimate of a quantity which is a function only of 


block effects. б М 
An estimate of the effect of treatment i, adjusted for block effects, is 


given by 
0; = kT, — By = кт. 
If there were no confounding, 
kT, — КК + 1)6 = k(k + 1)т,. 
The relative within-block information, or efficiency, is 
2 
cioe) (ш es 
k(k--1 k+1 
The quantity О, provides an intrablock estimate of treatment effect т). 
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The estimate of variation due to block effects, adjusted for treatments, is 
given by 

xw? 
Ik + 1)" 
The expected value of the mean square corresponding to this source of 
variation may be shown to be 


SSpiocks(aajy = 


ЙН 
Е(Е,) = o? оў. 
(Е,) бача 17 


The other sources of variation in the preliminary analysis are obtained as 
follows: If А, is the sum of the К? observations in replication m, 


ER? с? 
SSreps = = Warr ial. 
k dk) 
The variation due to the treatments, the blocks being disregarded, is 
ET? с? 


SSireat(unadj) = asa == I D Й 
The total variation is given by 
mi с? 
SStotar = УХ? — Rk +1) . 
The intrablock error is obtained by subtraction. 
SSintrapiock = SStota1 — SSreps — SStreat(unadj) — SSiocks(adj): 


Information for estimating treatment effects comes from two sources. 
The primary source is that obtained from differences between pairs of treat- 
ments in the same block—this source is known as the intrablock, or within- 
block, information. The error associated with estimates obtained from 
intrablock information is E,, the intrablock mean square. The secondary 
source of information on the treatment effects is obtained from differences 
between the В"; such information is called interblock treatment informa- 
tion. The error associated with this source of information is E,. (Usually 
E, will be considerably larger than E,.) 

In general, given two estimates т, and m, of the same parameter with 
respective variances o? and o3, the best linear unbiased estimate of the pa- 
rameter has the form 


2. (т,/03) + (т,/02) 
(1/91) + (1/63) 
Іп words, the best unbiased linear estimate is a weighted average of the 


individual estimates, the weights being the reciprocals of the respective 
variances. This last expression is algebraically equivalent to 


т = m, + (т; — m), 


2 
where u= rey 
ded 
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If o is large relative to ої, и will tend to be near zero. In this case m, will 
not differ appreciably from m. The conditions under which т is а uni- 
formly better unbiased estimator of the parameter estimated by т, and ma 
are discussed by Graybill and Deal (1959). 

This general principle for combining two estimates may be used in com- 
bining intrablock and interblock treatment information. In the latter case 
the best linear unbiased estimate of the sum of the effects due to treatment i 
has the form 


E, — E, 

ME, ` 
In this last expression, the term W, is not of the form m, — ті; hence 
и does not have the same definition as that given in the last paragraph. 
(Since the parameter which и estimates is a ratio of variances, the parameter 
cannot assume a negative value. If u should be negative, 4 is assumed to be 
zero.) When E,islarge relative to E,, only the within-block information on 
the treatments need be taken into account. As estimate of the latter is 
given by 


T; = q + pM, where и = 


"ES 22081 : 
T; meee PE 06: 


i 


The error variance associated with this estimate is E,. 
The error variance associated with estimates which utilize both intra- and 
interblock information is 
E; = E,(1 + Ки). 
E/ is the effective error mean square, and its degrees of freedom are 
taken as being numerically equal to the degrees of freedom of Е,. Indi- 
vidual tests between adjusted treatment totals have the form 
Oy Ta)? 
T X + DE, 
Any of the procedures for making all possible comparisons among treat- 
ment means may be adapted for use here—in this case the adjusted treatment 
means are used, and E; has the role of the error mean square. 
An approximate over-all test on the differences between treatments uses 
ATY (еді 
SStreat(aaj) = ЕНІ — FED Қ 


Тһе F ratio has the form 
_ MStreataas) 
pee 
A simple balanced lattice for the case k? = 16 in blocks of size К = 4 is 


given in Table 9.2-2. The number of replications required for balance is 
Е +1=5. Note that the pair of treatments (1,2) occurs together in the 


F 
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Table 9.2-2 Balanced Simple Lattice for 16 Treatments in Blocks 
of Size 4 


Block Replication 1 Block Replication 2 


same block only once—namely, in block 1. Note also that the pair (6,16) 

occurs together in the same block only once—namely, in block 9. In 

general, cach of the [16(15)]/[1(2)] = 120 possible unique pairs occurs 
block 


in the same once and only once, Since each block contains 
К тоне 6 pairs, the minimum number of blocks required for balance 

M „ 

In this design there are k + 1 = Sobservations on each treatment. If'it 
is desired to have 5n observationsoneach treatment, л sets of k + 1 replica- 
tions may be constructed, The latter design would require 20n subjects. 
More extensive tables of balanced simple lattices are given in Cochran 
and Cox (1957, рр. 428-432). In using these designs, the blocks within a 
replication are arranged in a randomized order, and the treatments within a 

ized order, 
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When the replications are disregarded, the adjusted sum of squares for 
blocks is given by 
55ыоекелйр = $$моекщивайр + $бикацай) tor blocs ~ SSrrear(unaa n 


The unadjusted sums of squares for blocks and treatments are computed as 
if there were no confounding; i.c., 


xs o c 

Shana T cT 
rm 6 
wl 20... 

геч PCIE 


To obtain the sum of squares for treatments adjusted for blocks, one first 
computes quantities of the form 


0,= КТ, — Bo. 
Then SSuestind] for blocks) ze 
The adjusted. treatment. totals, utilizing both intrablock and interblock 
information, are given by 


Ti = T, ИИ, 

(Кф kd ' 

А[д + k — DE, + E,] 

It should be noted that 55, м) ње аа) із different from. SS, а) 


The latter utilizes both intrablock and interblock information; the former 
utilizes only intrablock information. In this case 


„уту c 

несе EF 

The effective error in this case is 
E; = ЕДІ + 4и), 

9.3 Numerical Example of Balanced Simple Lattice 
Computational procedures for this design will be illustrated by means of 
on pee eS cn а тете hacer parcel 
are nine different not considered experi- 
irr ire арр respond to all the pictures. 
subject can, however, to three pictures. The criterion is the num- 
ber of responses ina E eiit un. Tcl 
which the subject corresponds to 


where “= 
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is indicated in Table 9.3-1. The numbers in parentheses indicate the treat- 
ments. 

Four random samples of three subjects each were obtained for the experi- 
ment. The samples were assigned at random to the replications; subjects 
within samples were assigned at random to the blocks. Data obtained are 
shown in Table 9.3-1. For example, subject 1 responded to pictures (1), 
(3), and (2) and had respective criterion scores of 60, 95, and 90. The 


Table 9.3-1 Numerical Example of Balanced Simple Lattice 


(К = 3) 
Subject Subject 
а), 9; .@ (0 а) (7) 
1 16027957: 400” 245 4 160 115 90 26 
(0 (4 (5) (0) (5 (8) 
2 150 125 140 415 5 120 160 85 365 
(8) (С) (9 () (9 (9 
3 70 75 60 205 6 100 175 110 385 
Rep 1 total = 865 ~ Rep 2 total = 111 
Subject Subject 
(5) eG), (0) (0 (8 (6 
7 140: :82 790” 275 10 105 75 150 730 
0) Oo © Q o o 
8 70 65 110 245 11 100 130 80 310 
(6) (5) (4 (5) (7) (3) 
9 2208 oom 956 1 175 12 170 85 120 215 
Rep 3 total — 695 Rep 4 total — 1015 
— ту мени a ЭШ |. жалан МӨ нт беа 
=X? = 426,000 


С°/36 = 378,225 
SStota = 47,775 
АШАБЫ ЕСЕБШЕ ы МЕ 


sequence in which the pictures were presented was determined by a randomi- 
zation procedure. Block totals as well as replication totals are given in 
Table 9.3-1. 

Totals needed to obtain sums of squares are given in Table 9.3-2. The 
first entry in column T, is the sum of all Observations on treatment 1; this 
total is 

60 from block 1 
115 from block 4 
85 from block 7 
105 from block 10 
T, — 365 
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Other entries in column 7; are obtained in an analogous manner. Тһе first 
entry in column Bg is 
Ва) = sum of totals for blocks containing treat 1 
B, + B, + B, + By = УВ, 
4) 


= 245 + 365 + 275 + 330 = 1215. 

The second entry in column В) is 
By) = sum of totals for blocks containing treat 2 

mulu Maw one 

= 245 + 365 + 245 + 310 = 1165. 
The symbol 2% indicates the sum of block totals which contain an entry 
from treatment 2. Asa partial check on the computations, the sum of the 
entries in column Ba should be (k — 1)G. 


Table 9.3-2 Numerical Example (Continued) 


---------------------------------- 


W, = = | a 3 

Treat T; Buy 3T, — 48) + G T; + uW,| ЗТ; — Buy T; 
1 365 1,215 —175 361,0 —120 356.7 
2 380 1,165 170 389.0 —25 398.9 
3 385 1,180 125 391.6 —25 398.9 
4 510 1,265 160 518.5 265 527.8 
5 610 1,430 —200 599.4 400 587.8 
6 585 1,375 —55 582.1 380 578.9 
7 315 1,190 —125 308.4 —245 301.1 
8 250 1,075 140 257.4 —325 265.6 
9 290 1,175 -140 282.6 —305 274.4 
G = 3690 11,070 = 3G 0 3690.0 0 3690.1 


p 


The first entry in column W, is 
W,—kT, —(k+1)Bo +G 
= 3(365) — 4(1215) -- 3690 = —75. 
Other entries in column W; are obtained in an analogous manner. Asa 
partial check on the computations the sum of the entries in column W, 
should be zero. 
The variation due to replications is 


ZR G* 
SSreps = =” — ——— 
rere Ko DD 
8562 + 11157 + 695? + 1015* 36907 
с 9 36 


= 11,186. 
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The variation due to treatments, the confounding with blocks being dis- 
regarded, is 

572,226 

k+1 kK(k+1) 

365° + 380° + +--+ 290? 3690? 


SStreat(unaaj) = 


4 36 
= 33,300. 
The variation due to blocks, adjusted for confounding with treatments, is 
SS ZW? (—75)° ++ (170)? + - - - + (—140)° 
Wm 
blocks(adj) K(k gu 1) 3 4) 
— 1607. 
The total variation is 
` С? 
SStotar = EX? — 041) = 426,000 — 378,225 = 47,775. 


The variation due to intrablock error is obtained by subtraction; i.e. 


SSintrapiock error — SStotai = SSreps чэ SStreat(unaajy => SSpiocks(aaj) 
= 1682. 

A summary of the analysis of variance is given in Table 9.3-3. Since the 
blocks are nested within the replications the degrees of freedom are 
(k — Ik + 1) =8. 

There is an alternative, more general, computational procedure for 


obtaining the sum of squares for blocks adjusted for treatments. It will be 
found that 


SShiocks(aaj) = SSbiocks w. reps + SStreat(aaj tor blocks) — SSireat(unad)). 


Variation due to blocks within replications, confounding with treatments 
being disregarded, is 
УВ ER, 
k ki 
= 10,872. 


Variation due to treatments adjusted for blocks is given by 


ыу 2 
SStreat(aaj tor blocks) = =o = 24,035. 


SSpiocks(aajy = 10,872 + 24,035 — 33,300 = 1607. 


SSpiocks w.reps = 


Thus, 


Within rounding error, the two computational procedures will give 
identical results. 
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The weighting factor и needed to obtain an estimate of the adjusted 
treatment total Т; is 
EVE, | 201= 105 
karpe 9(201) 


= .0531. 


The adjusted treatment totals are given in Table 9.3-2. For example, 


Tj = T, + uW, = 365 + (.0531)(—75) = 361.0. 


This total combines both intrablock and interblock treatment information. 
An estimate which uses only intrablock information is given by 


T= о ++ 06 
= (8)(—120) + 410 = 356.7. 
When E, is large relative to Е,, Т; and Т) will be approximately equal. 


Table 9.3-3 Summary of Analysis of Variance 


Source of variation SS df MS 
Replications 11,186 3 
Treatments (unadj) 33,300 8 4162 
Blocks within reps (adj) 1,607 8 201 = Е, 
Intrablock error 1,682 16 105 = E, 
Total 47,775 35 


Tests оп the difference between two 7;’s, which аге equivalent to tests on 
the difference between corresponding means, require the effective mean 
square for error, 


E; = E,(1 + ku) = 105[1 + 3(.0531)] = 122. 


To test the hypothesis that т, = ту against a two-tailed alternative, one 
obtains the statistic 

(Ti — Ty 18.5 = 257.4? © 6.96 

2(k + DE; 8(122) 


For a .05-level test, the critical value is F (1,16) = 4.49, assuming that this 

test is considered as being a priori. Since the observed F exceeds the 

critical value, the experimental data reject the hypothesis that т, = ту. 
An approximate over-all test on the hypothesis that o? = 0 is given by 


_ MStreat(aas) 


E , 
Е, 
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The sum of squares corresponding to the Тг is 


"T PNE 
treat(adj) = 17 m 1 Kk ut 1) 
= 32,511. 
: _ SStreatiaajy _ 
From this, MStreat(aaj) = mr em 4071. 
Hence F= 4011 2:327 
122 


The critical value for a .05-level test is Е»(8,16) = 2.59. Any of the pro- 
cedures for making all possible comparisons between the adjusted treat- 
ment means, 7) = T7/(k + 1), may be adapted for use here, with E; having 
the role of MS,,,,,. 


9.4 Balanced Lattice-square Designs 


A balanced simple lattice has balance with respect to the rows (blocks) 
but not with respect to the columns (position within the block). А balanced 
lattice square has balance with Tespect to both rows and columns. The 
latter designs are readily constructed by using the same kind of pseudo 
factors introduced in connection with simple lattice designs. Principles 
underlying the analysis of lattice Squares are also most readily understood 
in terms of these pseudo factors, 

An example of a balanced lattice-square design is given in Table 9.4-1. 


Table 9.4-1 Balanced Lattice Square 


Replication 1 Replication 2 
(6) (4) (5) (8) (4) (3) 
(3) (1) (2) (6) (2) (7) 
(9) (7) (8) (1) (9) (5) 
Replication 3 Replication 4 
(4) (2) (9) (7) а) (4) 
(3) (7) (5) (9) (3) (6) 
(8) (6) а) (8) (2) (5) 


There аге k? = 9 treatments arranged in 3 x 3 squares. A minimum of 
k + 1 = 4 replications is required for double balance. To increase the 
number of observations on each treatment, a basic balanced plan may be 
repeated n times. In Table 9.4-1, each treatment is paired within the same 
row with every other treatment once. Further, each treatment is paired 
within the same column with every other treatment once. For example, the 
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following pairs, which include treatment 1, appear in the columns indicated : 


(1,4) and (1,7) Col. 2 of replication 1 
(1,6) and (1,8) Col. 1 of replication 2 
(1,5) and (1,9) Col. 3 of replication 3 
(1,2) and (1,3) Col. 2 of replication 4 


Thus each of the other eight treatments is paired with treatment 1 in the 
same column. Balanced lattice-square designs may be constructed for k 
equal to a prime number or some power of a prime number; that is, k = 2, 
2°, 3, 7, 9, 11, 13. Plans will be found in Cochran and Cox (1957, pp, 
497-506). 

The analysis of variance for n repetitions of а basic balanced set is 
generally carried out in two steps. In the first step, the total variation is 
partitioned as follows: 


Source df 
Replications nk +1) —1 
Treatments (unadj) 8-і 
Residual (kè — D(kn + n — 1) 

Total nk(k +1) — 1 


In the second step, the residual variation is partitioned in two different ways. 
The first partition of the residual is as follows: 


Source df 


Residual 

(a) Rows (mp) adj for treat 

(b) Col. (mp) adj for treat and rows 
Intrablock error 


(k? — 1)(kn + n — 1) 
n(k + Ik — 1) 
n(k + 1k — 1) 

(kn = n — Y? — 1) 


The notation (mp) indicates that the rows and columns are considered to be 
nested within replication m in repetition p. Within a single basic balanced 
plan there are k — 1 degrees of freedom for the rows within each of the 
k + 1 replications—hence there аге (k + 1)(k — 1) degrees of freedom for 
rows. For n repetitions of the basic plan, there will be n(k + 1)(k — 1) 
degrees of freedom for rows. 

From this partition one obtains an estimate of variance due to the 
columns, E,, which is free of treatment and row effects. 

The second partition of the residual is made as follows: 


Source df MS 
Residual (kè? — 1)(kn + n — 1) 
(c) Cols. (mp) adj for treat n(k + 1k — 1) 
(d) Rows (mp) adj for treat and cols. n(k + 1k — 1) E, 


Intrablock error (kn—n—1(*—1) E, 


Table 9.4-2 Numerical Example 
Repetition a 


Repetition 5 


Replication 1а 


Replication 1b 


Subject Total Subject Total 
© (4) (5) © @ (5 
la 2 3 7 12 1b 2 s 9 16 
Gi usa) з) (а) (2 
2а 3. 8 3 14 2b 4 8 5 17 
CMON (0) 9) (с) (8 
3a 6 5 3 14 3b 8 7 5 20 
11 16 13 40 14 20 19 53 
——— = шр ыша ee 
Replication 2a Replication 2b 
Subject Total Subject Total 
(8) (0 (3) 68) (4 (9 
4а 2 3 44 9 4b 4 5 6 15 
(б) Q (7) 6) (2) (7) 
5а 3 3 3 9 5b 4 4 5 13 
ЧИО) PNG) COONS) 
6a 8 7/ 5 20 65 9 8 7 24 
13 13 12 38 17 17 18 52 
ео! с 
Replication 3a Replication 3b 
Subject Total Subject Total 
(4) Q о) Gy) (9 
Ta 3 2 y 12 7b 5 4 8 17 
9) () (9 3) (0) (5 
8a 4 2 3 9 8b 8 4 5) 17 
OGM Ta) (8) (9 (1) 
9a 2 3 6 11 9b 2 4 6 12 
9 М1; «| 732 / Sy Plo] 46 
AG. PCI Ere Т.” 
Replication 4a Replication 4b 
Subject Total Subject Total 
M а) (5 а) 
10а 4 it 3 14 10b 4 9 5 18 
(9) () © 2 TG) (6) 
Па 4 4 13 115 6 5 5) 16 
68) Q (5 (a) (2)22- (5) 
12а 2 5 9 12b 3 4 8 15 
11 13 12 36 13 18 18 49 
G, = 146 
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This partition provides an estimate of variance due to row effects which is 
free of treatment and column effects. Both partitions provide the same 
estimate of intrablock error. Since treatment effects are not orthogonal 
to rows and columns, the adjusted variations are not completely additive. 
In terms of the notation in the above partitions of the residual variation, 
there is, however, the following additivity: 


(а) + (b) = (с) + (4). 


An adjusted treatment total in a lattice-square design has the following 
form: 


T = Т, AL + ШМ". 


where L; and M; depend upon row effects and column effects, respectively, 
and 4' and yu’ are weighting factors which depend primarily upon E,, E,, 
and E, Adjusted totals of this kind utilize intra- and interrow and 
column treatment information. The expected values for the error terms 
are given in the discussion of the numerical example. 

Numerical Example of Balanced Lattice Square. The example to be 
discussed will be cast in terms of a repeated-measure design. The rows of 
the square will represent the set of treatments assigned to a subject; the 
columns will represent the order in which the treatments are administered. 
Suppose that the basic data are those given in Table 9.4-2. 

In this table there are k? = 9 treatments arranged in a balanced set of 
3 x 3squares. There аге n = 2 repetitions of the same basic plan. Two 
samples of 12 subjects each are drawn at random from a specified popula- 
tion. The subjects from one sample are assigned at random to the rows of 
repetition a; the subjects from the second sample are assigned at random to 
the rows of repetition b. The criterion scores for each of the subjects are 
indicated. For example, subject 1а has scores of 2, 3, and 7 under the re- 
spective treatments (6), (4), and (5). 

The following notation will be used in the course of the analysis of 
variance: 


Tim) = sum of (k + 1) observations on treatment i in repetition p. " 
jm) = sum of k observations in row j within replication m of repetition p. 
Сопу) = sum of k observations in column j within replication m of repetition 


ті) = d of К? observations in replication т in repetition p. 
G, = sum of k*(k + 1) observations in repetition р. 
G = XG, = sum of all observations in the experiment. | 
Rj) = sum of totals for all rows in repetition p which contain treatment i. 
Су = sum of totals for all columns in repetition p containing treatment i. 


Repl 
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Table 9.4-3 Summary Data for Numerical Example 


Repetition а 


Treatment | Tia Қа) Сау Риу Ша Лао Қаш) М ita) 
1 29 59 58 1 =3 —2 0 1 
2 10 44 46 -2 0 -2 -6 =§ 
3 15 45 45 0 11 11 11 11 
4 12 47 50 42 -6 —9 —15 —18 
5 20 50 53 =3 6 3 -3 -6 
6 12 45 43 2 2 4 8 10 
7 14 46 46 0 4 4 4 4 
8 9 43 46 =3 1 —2 —8 -11 
9 25 59 51 8 --15 —7 9 17 

146 = С, 438 438 0 0 0 0 0 
Repetition b 

Treatment | 7) Ry С Di Liw Лоу Ko Mi 
1 32 71 74 -3 12 9 3 0 
2 17 62 66 -4 3 -1 —9 —13 
3 23 65 65 0 9 9 9 9 
4 20 66 70 -4 -4 -8 —16 —20 
5 29 72 74 2 -1 -3 -7 -9 
6 15 57 61 -4. 17 13 5 1 
7 20 68 63 3 —12 -7 3 8 
8 14 62 64 -2 -6 -8 —12 —14 
9 30 77 63 14 —18 -4 24 38 

200 = G, 600 600 0 0 0 0 0 


The following symbols are defined in terms of those given above: 
Dio) = Rit) — Cys 
49) = KT) — (k + 1)К + Gp 
Jit) = Diy + Li, 
Kio) = Jio + (k — 1) Dig, 
Mic) = Dig) + Кү. 

Summary information for the data in Table 9.4-2 is given in Table 9.4-3. 
The repetition to which the symbols refer is indicated in the headings for 
the parts. Where the Tepetition to which reference is made is clear, the 
subscript indicating the repetition will be dropped. Under repetition a, 


8 from subject 2a 
8 from subject 6a 
6 from subject 9a 
_7 from subject 10a 
1: -- 29 
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The entry А, in repetition a is obtained as follows: 


14 row total in rep 1a containing treat 1 
20 row total in rep 2a containing treat 1 
11 row total in rep 3a containing treat 1 
14 row total in rep 4a containing treat 1 


К, = 59 
The entry С, is obtained as follows: 


16 col total in rep 1а containing treat 1 
13 col total in rep 2a containing treat 1 
16 col total in rep 3a containing treat 1 
13 col total in rep 4a containing treat 1 


C, = 58 


By using different sections of the keyboard оп a calculator, 71, Ri, and C, 
may be obtained in a single summation process. As a partial check on the 
computational work, 


Eno = Gs; ХК = ХС) = kG,. 
It may be shown algebraically that 


Mig) = Da + Ki) 
= KT — (k + 1)Ci + б. 
For example, Mj = 1+0=1 
= 3(29) — 4(58) + 146 = 1 


This dual method for computing M; also provides a numerical check. 

The summary data in Table 9.4-4 may be computed by two different 
methods. The simplest method is to add corresponding entries in repeti- 
tions aand bin Table9.4-3. Equivalently, опе may compute entries іп the 
first three columns directly from the basic data in Table 9.4-2 by summing 
over all repetitions. The other columns are obtained from the first three 
columns. 

Convenient computational symbols are defined in Table 9.4-5. The 
basic data for the computation of symbols (5) through (8) are contained in 
Table 9.4-3. Тһе summations are over all the repetitions. Data for sym- 
bols (2) and (4) are in Table 9.4-2; data for symbol (3) are in Table 9.4-4. 
The first phase of the analysis of variance appears at the bottom of Table 
9.4-5. In reality the residual variation is obtained by subtracting the 
variation due to replications and treatment (unadjusted) from the total 
variation. This residual includes variation due to the columns, rows, and 
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Table 9.4-4 Summary Data for Numerical Example (Continued) 


Repetition а + repetition 5 


Treatment} 7; Ri Ci D; L; Ji K; M; 
1 61 130 132 -2 9 7 3 1 
2 27 106 112 —6 3 —3 —15 -21 
3 38 110 110 0 20 20 20 20 
4 32 113 120 =y —10 -17 —31 —38 
5 49 122 127 -$ 5 0 —10 --15 
6 27 102 104 L2 19 17 13 1 
7 34 114 109 5 38 -3 g 12 
8 23 105 110 235 -5 —10 —20 25 
9 55 136 114 22 —33 =11 33 55 

346 — G 1038 1038 0 0 0 0 0 


Table 9.4-5 Computation of Sums of Squares 


(1) = Сдпк + 1) (5) = ELi) Ikk + 1) 
(2) = EX? (6) = (Kin) lkk — 1) 
(3) = (ETD[n( + 1) (7) = [XM + 1) 
а) = (тері, „к (8) = EU Ikk — 1) 
(1) = (346)2/72 = 1662.72 
(2) = Хх? = 1956 
(3) = (61° + 27° +... + 5528 = 1844.75 
(4) = (40° + 532 +--+ + 36° + 4959 = 1710.44 
0) = [3% + OF D .3 (18908 — 1381 
(6) = [0 + (-6* +--+ +32 + (—9)? +--> + 24954 = 3419 
(D = UF + (—89 + ++ + 08 + (—138 4 os 4. 387/108 = 3248 
(8) =[(—2# + (—2)8 - 4984 (508 4... (—4)/54 = 1552 
Source SS df MS 
Replications (4) - (1) = 4772 7. 
Treatments (unadj) (3) — (1) = 182.03 8 22.75 
Residual (2) — (3) — (4) + (1) = 63.53 56 1.13 
Total (2) — (1) = 293.28 T 


WELL EL Rd eT ET [os on 
intrablock error. Since the treatment effects are not orthogonal to the row 


and column effects, there is not complete additivity for the adjusted varia- 
tion. 


_The Subdivision of the residual is given in Table 9.4-6. Under one sub- 
division, an estimate of the column variation (adjusted for treatments and 


> 


rounding error the entries should be identical. The expected values of the 
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corresponding mean squares are as follows: 


k—1 
dc des 
n 


nk 1 
n 


E(E) = 02 + 


2 
Ocolumns; 


EŒ) = o? + 


E(E,) = o. 
The weighting factors needed to obtain adjusted treatment totals are also 
defined in Table 9.4-6. In terms of expected values of the mean squares, 


it may be shown that 
1 


EW) = ores 
m) о? + kotows 
1 
E(W,) =, — 
(Wo) о? + kocolumns 
1 
БИ) ==. 
8а 


Adjusted treatment totals have the general form 
T; = T; + JL, + М, 
where 4' and. и’ are defined in Table 9.4-6. Тһе effective error associated 
with differences between adjusted totals is 
E; = E,[1 + k(’ + ш) = .647[1 + 3(0429 + .0846)] 
= .894. 

Adjusted treatment totals and means are summarized in Table 9.4-7. 
These adjusted totals combine interrow and intercolumn treatment 
information with the intrarow and intracolumn treatment information. - 

A test on the hypothesis that 7; = тз, for example, uses the statistic 

_ (y = T) _ (6147 — 40.55) 
~ 2n(k + 1)Е, 4(4)(.894) 
= 30.60. 
A critical value, for significance level .05 under the Scheffé procedure, is 
ВЕ (8,24) = 18.9. In summarizing the information on adjusted means, 


any of the test procedures described in Sec. 3.7 may be used. қ 
An approximate over-all test оп the hypothesis of no treatment dif- 


ferences is given by 
MSp 
E; 
Ст) /пск + 9] — (0) 
4 


F= 


where MSp- 


орар — TEDE Lre _ 


SERT bed 7x 
PA = 3) "ur иу E Ca D+ ^u + tay _ 3 
m- ^n $ M-m у 
П = 896° = 
(1%9') — (Ero _ (Lr9) — (696)9 _ 
$ F © E 
"au tae: 7 ааш. EE Se 
sue A Legeg M 9ST = == M 


"d= [9 vc 6:61 = (8) — (4) — 
(1) + Ф) - (O — (0) 10119 ОЕШ 
“A = 696 9I 2651- (8) (109 pu? 1721) 20у fpe) дәл “м smoy 
9I SCE = (1) (1241 10) Гре) dəx >м "јод 
95 Є<`Є9 enpisay 
"A= д9 vc #561 = (9) — (6) — 
(1) + Ф) — (є)— @ 0119 xoo[qeaju] 
94 = LETZ 9r 6Г?Е = (9) (Smo. pue вәд 10) Гре) дәл см "ооу 
9I IS'£I = (с) Geaa) 103 Гре) дәл “м smoy 
95 €s'€9 = (D + (р) — ©) — (2) Tenpisoy 


SW Jp SS eoinog 
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In this case, 
1877.02 — 1662.72 214.30 
8 


MS, 26.79, 


and F= 00098 29.97. 


71894 


For a .05-level test, the critical value is F,,(8,24) = 2.36. 
Table 9.4-7 Adjusted Treatment Totals and Means 


Treatment | T; = T; + (.0429)L; + (.0846)M; Ti Т, 
1 61.47 73 7.6 
Р) 25.35 2.9 3.4 
3 40.55 5.1 4.8 
4 28.36 3.5 4.0 
5 47.95 6.0 6.1 
6 28.74 3.6 3.4 
7 34.67 43 42 
8 20.67 2.6 2.9 
9 58.24 а 6.9 

G = 346.00 А 43.0 43.3 
біті + 1) = 43.2 


9.5 Balanced Incomplete-block Designs 


The balanced lattice designs considered in Sec. 9.3 are special cases of the 
general class of balanced incomplete-block designs. In the general case 
the following notation is used: 


number of treatments. 

— block size. 

number of blocks. 

number of replications. xi 

number of times each pair of treatments occurs together within some block. 


LY егш 
Vd gp god 


A necessary condition for a balanced design is that each possible pair of 
treatments occurs together within some block the same number of times; 
that is, 2 is a constant for all pairs. | . 

The design given in Table 9.5-1 is balanced in this sense. It will be noted 
that treatment 1 is paired with treatment 2in blocks 1,2,and6. It willalso 
be noted that treatment 1 is paired with every other treatment in three blocks. 
Hence 4 = 3 for treatment 1. Similarly, 2 = 3 for all the other treatments. 
Inspection of this design will show that each of the treatments occurs in 6 
of the 10 blocks; hence there are six replications. In this particular design, 
the blocks cannot be grouped into distinct replications. However, blocks 
1 through 5 include three replications, and blocks 6 through 10 include 
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three replications. Insome balanced incomplete-block designs it is possible 
to group the blocks into distinct replications. In the design given in Table 
9.5-1,b = 10. A question might be raised about the minimum number of 
blocks required for balance, assuming / and k specified. 


Table 9.5-1 Balanced Incomplete-block Design 
(t =5,k =3, b = 10,r = 6, 2 = 3) 


Block Treatments Block Treatments 


(1) 17273 (6) D 
Q) 1. 2 5$ (7) 13 4 
(3) 1:4 5 (8) gs XE) 
(4) 2^3 4 (9) 23 5 
(5) 345 (10) 2 4 5 


The total number of observations in an experiment having b blocks, with 
К observations in each block, is bk. If there аге ғ replications of / treat- 
ments, then the total number of observations is rt. Thus, the following 
relation must hold: 


bk — rt. 
From this relation, 
ban, 
k 
If Kk —3andt = 5, 
PED 
3 


Since b must be a whole number, r = 3 is the smallest number which will 
make bequal toa whole number. Withr =3andb = 5, it is not possible 
to construct a balanced design for the specified k and t in five blocks. The 
next larger value of г that will make b a whole number is 6. Whenr = 6 
and b = 10, it is possible to construct a balanced design having 10 blocks; 
further, this is the minimum number of blocks for which a balanced design 
(having t = 5 and k = 3) may be constructed. The minimum block size 
for other combinations of г and К is given in Table 9.5-2. Balanced in- 
complete-block designs are available for these combinations of ¢ and k with 
the minimum P indicated. A more extensive table of available balanced 
incomplete-block designs will be found in Cochran and Cox (1957, pp. 
469-470). 

With block size k, treatment i may be paired with k — 1 other treatments 
within a single block. If there аге ғ replications, treatment i will appear in 
r different blocks. Hence the total number of pairs in which treatment i 
appears within some common block will be r(k — 1). For example, in the 
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design in Table 9.5-1 treatment 1 occurs in two pairs with each of blocks 1, 
2, 3, 6, 7, and 8. Hence the total number of pairs occurring within some 
common block which include treatment 1 is 6(2) = 12. 

In a balanced design each treatment is paired within some common block 
with the other г — 1 treatment å times. Hence the total number of pairs 
occurring within some common block must be equal to A(t — 1) if the design 


Table 9.5-2 Some Available Balanced Incomplete-block Designs 
_ о ERO ER o Te ара 


t k r b E 
= 2 3 6 .67 
5 2 4 10 62 
5 3 6 10 83 
6 2 5 15 .60 
6 3 5 10 .80 
6 4 10 15 .90 
y 2 6 21 58 
7 3 3 7 178 
7 4 4 7 88 
8 2 7 28 57 
8 4 7 14 86 
9 2 8 36 .56 
9 3 4 12 5 
9 4 8 18 84 
9 5 10 18 90 
10 2 9 45 56 
10 3 9 30 74 
10 4 6 15 83 
10 5 9 18 89 
10 6 9 15 93 


is to be balanced. In general, for a balanced incomplete-block design the 
following relation must hold: 


r(k — 1) = At — 1). 


By definition, the efficiency factor, designated by the symbol Е, is 


Eom 1) 
E aD: 


It will be found that the number of effective replications for within-block 
treatment information will be equal to rE. Ву using the necessary relation- 
ships for a balanced design, through straightforward algebraic manipula- 


tion it may be shown that 
krE=th=kr—r+. 
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The analysis of variance for the case in which the replication dimension is 
disregarded has the following form: 


Source df | MS 
Treatments (unadj) f 
Blocks (adj) 5-1 Е, 
Intrablock error rt —t—b +1 E, 
Total ТЕЗІ | 


Before indicating the method for obtaining the sums of squares іп the anal- 
ysis of variance, some of the principles underlying the adjustments will be 
indicated. These principles are identical to those used in balanced lattice 
designs. 

Assuming a linear model in which block effects are strictly additive, the 
sum of the r observations on treatment i provides an estimate of the param- 
eters on the right-hand side of the following expression: 


T; = ти + тт, + УВ, 
(i) 


where the last symbol on the right-hand side is the sum of the block effects 
in which treatment i appears. Under the same model, the sum of the block 
totals in which treatment i appears estimates the parameters on the right- 
hand side of the following expression: 


Boy = kru + rr; + AEn, + КО, 
(i) 


where 7; represents any treatment exceptz, Since Ут, = 0, it follows that 
тт; + AEn, = (г — A)r; + ir; + AEn 
= (r — A. 
This is so because 47; + Ул, = т; = 0. Hence, 
B) = kru + (т — Ar, + КУВ,. 
m 
1 In terms of T; and Bu), an unbiased estimate of the effect of treatment i is 
given by 
КТ, == kru + krr; + кув, 
(с 
Bo = kru + (r — A)r; + КУ, 
[3] 
Q; = kT, — By = (kr — r + ғу 
= КғЕт, 


If there were no confounding with block effects, the right-hand side of this 
last expression would be krz,. The efficiency factor E, which is always less 
than or equal to unity, is a measure of the relative amount of within-block 


BALANCED INCOMPLETE-BLOCK DESIGNS 481 


treatment information per replication. A balanced incomplete-block 
design is more precise than a randomized- (complete-)block design only if 


02 < Eo’, 
where c? is the error variance for the incomplete-block design and о? is the 
error variance for the randomized-block design having an equal number of 
replications. 
The sums of squares required in the analysis of variance are obtained as 
follows: 


G 
SStotat = Ух? ар) 
rt 
De С 
SStreat(unadj) ------- > 
r rt 
xo? 
SStreat(adj tor blocks) = 20 4 
IB б 
SSpiocks(unadj) = —— — —7. 
ocks(unadj) k E > 


SSpiocks(aaj) = SSpiocks(unadj) + SStreat(ad] tor blocks) — SStreatqunadj); 
SSintrablock error = SStotal — SStreat(unadj) — SSpiocks(ad i) 
An estimate of a treatment total, which combines intrablock and inter- 
block information, is given by 
т = T; + Wi 
where 
(b — 1XE, = E.) 

(К — 1)(b — DE, + (t — Ю(Ъ DE, 
Bs W, = (t — ЮТ, + (t — 1)В + (k — DG. 


(If E, is less than E,, then the value of x is taken to be zero.) It should be 
noted that the definition of W; in this design differs from the corresponding 


definition for a lattice design. Wi in this case is equivalent to k — 1 times 
W, for the balanced lattice. The effective error for adjusted treatment 


totals is 


dum 


E; — Е + (t — kyu]. 
A test on the difference between two adjusted treatment totals uses the 
Statistic 
GY 
2rE; 

If means are used instead of treatment totals, the F ratio is 

cu um 

2(E;/r) 
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For a .05-level test the critical value is F,,[1, (rt — t — b + 1). Any of 
the methods for making multiple comparisons discussed in Chap. 3 may be 
adapted for use in this analysis. 

The over-all variation due to the adjusted treatment effects is 
UT c 

r rt 
This source of variation has / — 1 degrees of freedom. An approximate 
over-all test on differences between treatment effects is given by 

Re MStreat(aas) b 
Е, 
If only the intrablock information on treatments is used, a test оп dif- 
ferences between treatment effects is given by 
F= MStreat(adj tor blocks) 
E, 

The latter F ratio will follow an F distribution when the usual analysis of 
variance assumptions are met. However, the F ratio having E; as a de- 
nominator does not quite follow the usual F distribution, since no account 
is taken of the sampling error in the estimation of the weighting factor p. 
The sampling error in и does not have a marked effect upon the distribution 
of the F ratio when the number of blocks is greater than 10. 

For designs in which, simultaneously, К is less than 5 and b is less than 10, 


interblock information on treatmentsis generally disregarded. Іп this case, 
the adjusted treatment total is given by 


SStreatiaaj) = 


„_ 0; 
T = = 5 
dern + rG 
An adjusted treatment mean is given by 
тете 
КЕ 2 


Thus 7; is a treatment mean based solely upon intrablock information, 
whereas Т; is a treatment mean which combines both intrablock and inter- 
block treatment information. A test on the difference between two treat- 
ment means which is based only on intrablock information uses the 
following test statistic, 

(еШ 


2KEJE) ' 
where E = (К — 1)/k(t — 1). 

The adjusted sum of squares for blocks may be computed by an alter- 
native method. As a first step one computes the treatment component of 
the blocks (ignoring treatments), which is given by 

ss, = 250 — [CB] 
k(r — 2) é 


BALANCED INCOMPLETE-BLOCK DESIGNS 483 


What is called the remainder sum of squares is given by 
SSrem = SSpiocks — SS}. 
The remainder sum of squares has Р — t degrees of freedom. The treat- 
ment component of the blocks (adjusted for treatments) is given by 
xw 
=== 
ri(t — kk — 1) 
This source of variation has t — 1 degrees of freedom. The adjusted sum 
of squares for blocks is given by 
SSpiocxs(aaj) = SSrem + 55, 
with degrees of freedom 
b—1-(b-—1)-4(t— 1). 

In this latter form, the adjusted sum of squares for blocks is seen to have 
two components. The expected value of the mean squares due to the com- 
ponents as well as the pooled sources are as follows: 

E(MSrem) Ix: оз gn Коў, 
E(MS,) = о? + КЕоў, 
‚ bk 1 02 

ь-1” 

Designs Arranged іп Distinct Replications. Some of the balanced 
incomplete-block designs may be arranged in distinct replications. The 
analysis of variance for such designs, provided that the experiment is con- 
ducted as a replicated experiment, has the following form: 


E(MSptocks(aa) = 0%: 


Source df MS 
Replications r-i 
Treatments (unadj) t-i 

Blocks within reps (adj) b-r Е, 

Intrablock error rt—t—bl E, 
Total "= 


oe OE ЕЕЕ = ———— 
The analysis of this design is identical to that which has just been outlined, 
except for two changes introduced by the fact that the blocks are nested 
within the replications. 
The variation due to replications is given by 
2 2 
SSreps = ER, > a , 
t rt 


where R,, is the sum of the ¢ observations in replication m. Since the 


blocks are nested, SSpiocks within reps Feplaces SS, 4. throughout. 
ZB} XR 


following relation: 


SSpiocksw. Teps(adj) = SSpiocks w. reps ар SStreat(aaj) EE SStreat(unaaj). 


Computation of the treatment variation is identical to that for the case in 
which there are no distinct replications. 

The definition of и for this type of design is different from that given pre- 
viously. In this case, 


КЕ, — E) 
ri(k — 1)E, + k(b — r — t + DE, 


The rest of the analysis has the same pattern as the design which has just 
been considered. 

Repetitions of Designs Not Having Distinct Replications. A balanced 
incomplete-block design will provide r observations on each treatment. If 
itis desiredtohave nr observations rather than r, the design may be repeated 
n times, with independent randomizations each time. It is convenient to 
let 5 = b'n, where b’ is the number of blocks in the original design. It will 
also beconvenient tolet r = r'n, where r’ is the number of replications in the 
original design. Thus, in this context, Pis the number of blocks in the en- 
larged plan, and r is the number of replications in the enlarged plan. In 
terms of this notation, the analysis of variance has the following form: 


a ОВ Б ш Шолу e 
Source df MS 


Repetitions л.—1 

Treatments (unadj) t—1 

Blocks within repetition (adj) b—n E, 

Intrablock error rt—t—b+1 E 
Total nrt —1 


a ee eee 


The sums of squares are obtained as follows: 


е 


(i) = SStom = Zx? — Z. 


^ 2 2 
(ii) = SSrepetitions = керу = = 5 
rt/n rt 
tee 2 ° 
(iii) = SStreat(unaaj) = = = e А 
г ті 


ы 2 

(iv) = SStreat(aaj tor blocks) = 20. y 
^ p2 2 

(v) = SSpiocks w. repetitions Tr == (Кер)? 5 
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The adjusted variation due to blocks within replication is given by the | 
rt[n 
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The adjusted sum of squares for blocks within repetitions is given by 
(vi) = SSpiocks w. repetitions(aaj) = (iv) + (v) — (iii). 
The intrablock error is 


SSintrablock error = (i) — (ii) — (iii) — (vi). 
For this case, 
(b — nXE, — Е,) 
(К — 1(b — n)E, + (t — EK(b —t—n4- 1)Е, 


(It will be noted that whenn = 1 this definition of p reduces to that given for 
the first design in this section.) The procedure for obtaining adjusted 
treatment totals and making tests is analogous to that followed for a single 
run of the basic balanced design. 

If the block corresponds to a person, this type of design calls for п groups 
of subjects with 5' subjects per group. The subjects within a group are 
assigned at random to the blocks defined by the basic plan. Each subject is 
given the К treatments within the block to which he is assigned; order of 
administration is randomized independently for each subject. In terms of 
a repeated-measure design with the subjects corresponding to blocks, the 
analysis of variance has the following form: 


“ 


Source df MS 
Groups nie 1 
Treatments (unadj) 1-1 
Subj w. groups (adj) b-n E, 
Within-subject error th—t—6b+1 E, 


If E, is large relative to E,, only within-block information on treatments is 
required. If between-block treatment information is disregarded, the 
analysis of variance takes the following form: 


Source df MS 
Groups n=1 
Subj w. groups (unadj) b-n 
Treatments (adj for blocks) 1—1 
Within-block error i= h—b +1 Р, 


The adjusted sum of squares for treatments is given by 


Egi 


SStreat(aaj for blocks) = rE. 


It is readily verified that 


SSsubj w. group(unadj) + SStreat(adj for blocks) = SSsubj w. атош(ай) + SStreat(unaaj). 
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When between-block treatment information is disregarded, E, is the proper 
error mean square in statistical tests. In testing differences between means, 
each mean is considered as being based upon rE replications rather than r 


replications. Thus the standard error of a difference is V E,f(rE) instead of 


VE, |r. 
9.6 Numerical Example of Balanced Incomplete-block Design 


Computational procedures will be illustrated by means of the data in 
Table 9.6-1. In the experiment from which these data were obtained, there 
are t = 6 treatments arranged in blocks of size k = 3. A balanced design 


Table 9.6-1 Numerical Example 
(t =6, k =3,r =5,b = 10, 4 =2, Е =.80) 


Subject Total | Subject Total 

0 Oo © o о ol 

1 2 3 8 13 6 3 6 5 14 
0 0) (9 0) GO o 

2 5 4 210 19 EA 4 7 10 21 
(0 (9 @ о) а) © 

3 4 8 8 20 8 7. 7 11 25 
а) (4) © GO © © 

4 LIST TA 28 9 9 12 11 32 
а) (0 © © © © 

5 6 9 13 28 10 9 14 13 36 


Wi = ЗТ, — 5B + 2G |Т! = Т, + .078W, 


1 —36 4 24.3 
2 —29 75 26.8 
3 2 14 40.1 
4 ~9 —29 35.7 
5 4 =7 56.5 
$ 31 —57 52.6 


236.0 


Tequires a minimum of 10 blocks, This design corresponds to plan 11.4 in 


Cochran and Cox (1957, p.471) There are r = 5 replications, but the 
blocks cannot be arranged in distinct Teplications. Each treatment is 
paned with each of the other treatments within a common block 4 = 2 
imes. 

| Subjects are assigned at random to the blocks. For example, subject 6 
is assigned to the block having treatments 2, 3,ап44. The criterion scores 
on subject 6 are, respectively, 3, 6, and 5. The sequences in which the 
treatments are given the subjects are randomized independently. In this 
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case there are k = 3 measures on each subject, the measures being the 
criterion scores on the treatments assigned to the blocks. The sum of the 
criterion scores for an individual jis B;. For example, 


B,—3--6--5—14. 


Sums required in the analysis of variance are given in the lower half of 
Table 9.6-1. An entry in column Т) represents the sum of the ғ = 5 
criterion scores on treatment i. For example, Т; is obtained as follows: 


Criterion 

score 
8 from subject 3 
9 from subject 4 
6 from subject 6 
7 from subject 7 
9 from subject 9 

39 = T, 


An entry in column B,, is the sum of the r = 5 totals for blocks in which 
treatment і appears. For example, Bis) is obtained as follows: 


В, 

20 from subject 3 

28 from subject 4 

14 from subject 6 

21 from subject 7 

32 from subject 9 
115 = Bi 


The formulas for Q; and Ж, are given at the top of their respective 
columns. For example, 
0» = 3(39) — 115 = 2, 
И, = 3(39) — 5(115) + 2(236) = 14. 


The general formula for 7; is 
Т = T, + yW, 

Hence the value of must be obtained before Т) can be computed. The 
value of is obtained after the preliminary analysis is completed. 

Convenient computational symbols are summarized in the upper part of 
Table 9.6-2. Symbol (2) is obtained from the individual criterion scores; 
data for all other symbols are in the lower half of Table 9.6-1. A partial 
check on the computational symbols is indicated. This check utilizes the 
fact that the remainder component of the sum of squares for blocks can be 
computed in two different ways. s 

In terms of these computational symbols, formulas for sums of squares in 
the analysis of variance are given in the lower portion of Table 9.6-2. The 
analysis of variance is summarized in Table 9.6-3. The weighting factor и 
and the effective mean square for error E; are also given in Table 9.6-3. 
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Table 9.6-2 Computation of Sums of Squares 


(1) = @(rt) = 1856.53 (4) = (®В?)у/к = 2026.67 

Q-Zx* =2168 (5) -(EQ5/K*rE) = 135.11 
хи? 

(3) = (®Т?уг = 2096.00 (6) - >W; = 55.42 


"(1 — k(k — 1) 


_ ЭВФ — Bold _ 84,982 — 83,544 — 159.78 
0 k(r — 2) 9 
Check: (4) —(0 — (7) = (4) + (5) — (3) — (6) 
10.36 = 10.36 
=——— — l — Иона 
SStotat = (2) — (1) = 311.47 
SStreat(unaay) - (3) — (1) = 239.47 
SSpiocks(unaajy = (4) — (1) = 170.14 
SSpiocks(aaj) = (4) + (5) — (3) = 65.78 


SSintrablock error = (2) — (4) – (5) = 622 
ee 


Table 9.6-3 Summary of Analysis of Variance 


Source SS df MS 
Treatments (unadj) 239.47 5 
Blocks (adj) 65.78 9 7.31 = E, 
е еггог 6.22 15 42 = Е, 
Total 311.47 29 
SS NN 
he (b — (Е, — E) 9(7.31 — .42) 
(Baha ee 2) 1 
(К — 00 — DE, + (t — ЮФ — DE, 60)9)730 + (3)(4)(.42) 


= 078 
E, = Ell + (t — Юи] = .42[1 + 3(.078)] = .52 


Table 9.6-4 Adjusted Treatment Means 


Treatment TET 5-5 Q: 
i ilr 7; ЕРЕДІ G 

1 4.86 4.87 

2 5.36 5.45 

3 8.02 8.04 

4 7.14 7.12 

5 11.30 11.29 

6 10.52 10.45 

47.20 47.22 


1G = 6(7.87) = 47.22 
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Given the value of и, entries in the column headed 7; in Table 9.6-1 may now 
be computed. The latter entries are the adjusted treatment totals for the 
combined intrablock and interblock data. 

Adjusted treatment means are summarized in Table 9.6-4. Two sets of 
means are given. 77 includes intra- as well as interblock information, 
whereas T7 is based solely оп intrablock information. In this case the two 
sets of means differ only in the second decimal place. Whenever E, is 
large relative to E, (in this case E,/E, = 7.31/.42 = 17.4), the contribution 
of the interblock information relative to the intrablock information will be 
relatively small. Thus, when E, is large relative to E,, 


T = T 
The over-all variation between the adjusted treatment totals is 
xn» c 
SStreat(aajy = ORES 
ТЕ rt 


= 2030.05 — 1856.53 = 173.52. 


The corresponding mean square is 173.52/(t — 1) = 34.70. An approxi- 
mate test of the hypothesis that there are no differences between the treat- 
ments, that is, c? = 0, uses the statistic 


_ MStreat(aay 3470 _ 66 73, 
E, 52 

The critica] value for a .01-level test is F (5,15) = 4.56. The test indicates 

that the treatment effects do differ at the .01 level of significance. З 
A test of the hypothesis that 74 = та, against a two-tailed alternative 

hypothesis, uses the following statistic (these two treatment effects are used 

for illustrative purposes): 

(Ty — Тү) _ 40.1— Bom 4 

2ГЕ; 2(5)(0.52) 


If this comparison may be considered as being in the а priori category, the 
critical value for a .01-еуе! test is Fg9(1,15) = 8.68. If this comparison is 
one among a relatively large number of comparisons that are to be made, 
and if an « level for the whole collection of tests is desired, the critical value 
(using the Scheffé approach) is 

(t — IF opl(t — 1), 9] = SFoo[5.15] = 504.56) = 22.80. 


The Newman-Keuls procedure for testing differences between ordered 
pairs of adjusted means is illustrated in Table 9.6-5. (In making these tests 
it is more convenient to work with the adjusted totals rather than the ad- 
justed means.) In this table, mis the number of steps two means are apart. 

When only the intrablock information is used in estimating treatment 
effects, E,/Ereplaces E; in making comparisons. Since the efficiency factor 


F 
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E is always less than unity, E,/E will be greater than E,. In the numerical 
example under consideration, 


E,/E = .42/.80 = .54. 


The smaller the efficiency factor, the greater is the difference between E, 
and E,/E, which is the effective error for comparisons for tests which use 
only intrablock information. 


Table 9.6-5 Tests on Differences between All Pairs of Treatments (Newman-Keuls) 
лш ee eee eames ad g haaa 


а) Q @ (5) © (9 
Ordered 7); 24:3:926:8.135.7.-.40.1.— 52.5 — 56.5 


о 4 G © (05) 
(1) |72,5 114% 158% 282% 322* 


Differences between all pairs (2) 8.9* 13.3* 25.7* 29.7* 
(4) 44 16.8* 20.8* 
(3) 12.4* 16.4* 
(6) 4.0 


$5 = МЕ; = У5(52) = 1.61 
q.s m,15) 4.17 4.83 525 5.56 5.80 
(1.61)q,99(m,15) 6.71 7.78 8.45 8.95 9.34 Critical 
values 


——M Á——M——— LO ÓÓÓ e — PM ———— 


It is of interest to point out some of the relationships that exist between 
the sums of squares in a balanced incomplete-block design. For example, 


SSpiocks(unaay) + SStreat(ad] tor blocks) = SSpiocks(aay) + SStreat(unadj), 
170.14 + 135.11 = 65.78 +- 239.47, 
305.25 — 305.25. 


SSrem == SSpiocks(unaay a SS, 
— 170.14 — 159.78 


Also, 


= 10.36, 
SSrem — SSpiocks(aaj) 33 SS, 
= 65.78 — 55.42 
= 10.36. 
In the above expressions, 
Zw? 


SSmo 
на Kk — 1)’ 


ss, — 280 — [Bor] 
k(r — 2) i 
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Estimation of Missing Data. One method of estimating a missing cell 
entry will be illustrated by means of the numerical example. The principle 
underlying this method of estimation is that of minimizing the intrablock 
error in the subsequent analysis of variance. Suppose, in Table 9.6-1, that 
the entry under treatment 6 for subject 4 were missing. If an x were sub- 
stituted for the missing entry, the totals in the lower part of this table would 
have the following form: 


Treatment T, Buy 0; 
1 24 96 +x -24-х 
2 21 92 -29 
3 39 103 + x 4-х 
4 38 123 -9 
Ы 57 140 41 
6 45+x | 128+х 7+ 2x 


If B, = sum of observations present in block with missing entry, 

О, = О value for treatment which has missing entry, ү 

Q; = sum of О values for treatments in same block as missing entry, 
then an estimate of the missing entry is given by 

T rt(k — 1)B, + k(t — 1)Q, — (t — DO; 
а (k— (к 1) — k(r — 1)] 

For the case being considered, the missing cell entry may be designated 
Хо that is, an observation on treatment 6 in block 4. 

B, =7+9 —16, the observations in block 4, 

0, =7, disregarding x, 

Q’ = Q, + Q; = —24 + 14 = —10 disregarding x. 
Q' has the value indicated since treatments 1 and 3 are present in the block 
with the missing cell entry. Thus 

x, = 302016) + (3)(5)(7) — G)YC— 10) 
en UU 2[(30)(2) — 3(5)] 
„ШЗ 2124. 
90 

Estimation procedures for the case of more than a single missing entry are 
illustrated in Cochran and Cox (1957, рр. 451-452). For each missing 
entry estimated one degree of freedom is subtracted from the intrablock 
error. Once the missing entries are estimated, the analysis proceeds in the 


usual way. я и 4 É 
In well-planned experimental work, particularly in cases in which a sub- 


ject corresponds to a block, the possibility of missing data can be avoided 
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by having reserve experimental units available for use should the need arise. 
In many cases such needs do arise. Estimation procedures for missing cell 
entries are at the very best only poor substitutes for the actual observations. 


9.7 Youden Squares 


In a balanced lattice-square design there is balance with respect to both 
rowandcolumn. Ananalogous balance applied to a balanced incomplete- 
block design is achieved in designs known as Youden squares. Although 
the shape of a Youden square is a rectangle, the name stems from the orig- 
inal method of its construction—namely, by deleting certain columns of 
special kinds (cyclic) of Latin squares. W. J. Youden developed and made 
extensive use of such designs in agricultural experimentation. Some 
interesting applications of Youden squares to psychological research are 
discussed by Durbin (1951). 

An example of a Youden square is given in Table9.7-1. With respect to 


Table 9.7-1 Youden Square 
((=b =7,k =r =3,4 = 1, E =.78) 


Column 
Block 

1 2 3 
1 4 6 T 
2; 5 "i 1 
3 Е Пр) 
4 ТІ 293 
5 Т БЕГЕ 
6 2 4 5 
if 3 5 6 


| 


the blocks (rows), this design has all the properties ofa balanced incomplete- 

block design. In addition, there is balance with respect to the columns— 

each treatment appears once and only once within each of the columns. 

Hence each column of a Youden square forms a distinct replication. It will 

ss found, however, that the blocks cannot be grouped into distinct replica- 
ions. 

In general, for a Youden square, t = b, andk = ғ. In cases in which the 
block represents a subject in a repeated-measure design, the columns may 
represent the order in which the treatments are administered to a subject. 
The analysis of variance for a Youden square follows the same general 
pattern as that given for the case of a balanced incomplete-block design in 
which the blocks cannot be arranged into distinct replications. Because 
t = b, the analysis becomes greatly simplified. 
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The analysis of variance for a Youden square has the following form: 


Source df MS 
Columns k-1 
Treatments (unadj) 1-1 

Blocks (adj) t—1 E, 

Intrablock error (k – 2)(t —2) E, 
kt —1 


The sums of squares are computed as follows (symbols have the same 
definitions as those used for the balanced incomplete-block design): 


G 


SStotar = ХХ — m 
X(col total? G? 
SScotumns = Ecol. total): mE 
1 kt 
STP OG 
SStreat(unads ITA Ey IE je , 
xw? 


SS ----------. 
blocks(adj) rit E: kk = 1) 


SSintrablock error — SSiotal A 55со1 т, SStreat(unadj) oo SSpiocks(ads), 


Because b = t, there is only one component, the treatment component, of 
the adjusted sum of squares for blocks; hence the simplified computation of 
SSyrocks(aajy aS compared with the general case of a balanced incomplete- 
block design. АЧ, 

The weighting factor for a Youden square simplifies to 


N23 
"E (к= DE, 


An adjusted treatment total has the form 


T; = T, + ШИ. 
A test on the difference between two treatment effects has the general form 
(ie 1%): 
Fost, 
2КЕ, 
where E; = [1 + (t — y]. 


In the analysis outlined above, both intrablock and interblock treatment 
information are used. If only intrablock information on treatments is 
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used, the analysis of variance has the following form: 


Source MS 
Columns 
Blocks (unadj) 
Treatments (adj for blocks) - 
Intrablock error (k — 2)(t — 2) E; 


kt —1 


The numerical value of E, will be found to be the same in both analyses. 
The sum of squares for treatments adjusted for blocks is 


i 


SStreat(aaj for blocks) = TE. . 


A treatment total, adjusted for blocks, is 
„__ Qi 
Т = = + Кб. 
Еч 


In this case а test on the difference between two treatment effects has the 
general form 
PAES Ti? ) 
2kE” 
where E; = E,/E. 
Construction of Youden Squares. Cyclic Latin squares are defined in 
Sec. 10.2. The following 5 x 5 Latin square represents a cyclic Latin 
square: 


a b c d e 
1 2103 ea 5.5 
ады ыыы, S 1 
3.24.5 1 2 
445 1 zw 
5 1 204.3. 4 


To construct a Youden square in which t = 5 and k = 4, any four of the 
five columns of this square will provide the required design. 
The following 7 x 7 Latin square also represents a cyclic square: 


a b С. портер: g 
T Р 
DOS Д ааа ТЕ А 1 
Зе doa So 6n 1850: 
4» қаб 7-6 M 4.42, 3 
25-46. dnd ВЕ: аи 
Е б 3 dis TR 
MUREST хааш Мб 6 
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A Youden square in which / = 7 and k = 3 may be constructed from this 
square by selecting any two adjacent columns and a third column which is 
one step from being adjacent. For example, columns a, b, and d meet these 
requirements. Columns с, е, and f also meet these requirements as do 
columns d, e, апа g. The Youden square given by columns d, e, and g is 
shown below: 


Block | d e g 
1 4.0. E7 
2 J AH 
3 6351—52 
4 m up v 
5 П nf 
6. 2: 34/715 
1 еда 


In using a design constructed in this manner, treatments are assigned at 
random to the numbers, and assignment of the blocks is also randomized. 
Columns within the blocks are likewise randomized. 

Since Youden squares are special cases of balanced incomplete-block 
designs, they exist only where there is a balanced incomplete-block design 
in which k = rand t = b. 

A partial listing of some of the available Youden squares is given in Table 
9.7-2. A more complete listing, as well as the structure of the corre- 
sponding plans, will be found in Cochran and Cox (1957, pp. 520-544). 


Table 9.7-2 Index to Some Youden Squares 


t=b ker л Е 
5 4 3 94 
6 5 4 96 
7 3 1 78 
7 4 2 88 
8 1 6 98 
m 5 2 88 
11 6 3 92 
13 4 1 81 
13 9 6 95 
15 7 3 92 
15 8 4 94 
16 6 2 89 
16 10 6 96 


Repetitions of a Youden Square. A single Youden square provides k 
observations on each treatment. In order to increase the number of obser- 


vations per treatment from k to nk, a Youden square may be repeated n 
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times. The columns of the Youden square should be randomized inde- 
pendently for each of the repetitions. The analysis of variance has the 
following general form: 


Source df MS 
Columns kn —1 
Treat (unadj) t—1 

Blocks w. reps (adj) n(t — 1) E, 

Intrablock error (t — D(kn —n — 1) E; 
Total nkt — 1 


Computational procedures for this case are illustrated by the numerical 
example given in the next section. 


9.8 Numerical Example of Youden Square 


The data in Table 9.8-1 will be used to illustrate the computational pro- 
cedures for the case of n repetitions of a Youden square. The basic plan 
has! = b = 7 treatments arranged in seven blocks. There are k = r = 3 
treatments in each block, and there are three replications in the basic plan. 
Two repetitions of the basic plan are to be used. The columns have been 
independently randomized. The treatments are enclosed in parentheses. 
Corresponding blocks appear in corresponding positions on the left and 
right. 

Suppose that п = 2 samples of t = 7 subjects each are drawn from а 
population of interest. The subjects in the first sample are assigned at 
random to the blocks in repetition 1; the subjects in the second sample are 
assigned at random to the blocks in repetition 2. In Table 9.8-1, subjects 
1 and 8 are assigned to the same basic block ; similarly, subjects 2 and 9 are 
assigned to the same basic block. Subject 1 is given treatments (4), (6), and 
(T) in this sequence. Subject 8 is given the same treatments, but in the 
sequence (7), (6), and (4). Similarly, subjects 2 and 9 are given the same set 
of treatments, but the sequence of administration differs. 

Suppose that the “treatments” represent seven different packages which 
are to be assigned a rating on a nine-point scale of attractiveness. This 
rating defines the criterion score. For example, subject 1 assigned the 
ratings 2, 4, and 6, respectively, to packages (4), (6), and (7). Тһе sum of 
the three ratings is 12. 

The sum of the k = 3 observations on a subject corresponds to a block 
total. This sum will be designated by the symbol Р„; thus P, is the sum of 
observations on subject 1. These block totals are summarized in the middle 
part of Table 9.8-1. There is a set of block totals for each repetition of the 
basic plan. Corresponding to each block in the basic plan there is also aset 
of n totals. The п = 2 totals for block 1 of the basic plan are 12 and 7. 
The sum of these totals, which is 19, will be designated by B,. Similarly, 
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Table 9.8-1 Data for Numerical Example 
(t =7,k =3,n =2,4=1) 
—————— ee Ee ee eee 


Repetition 1 Repetition 2 
Subject Total Subject Total 
4 (9 Q) gy. (0) 
1 2 (4: 6 |а 8 OMM) 1 1=P, 
ORO. uu (Т) (5) 
2 6 9 3 18 9 5 9 3 17 
© а) Q о) 0 © 
3 1 2 3 6 10 4 3 6 13 
(Т) Е (0) (9) oO Qo (7) 
4 4i 1 4 12 11 1 1 4 6 
а) () @ () QG а) 
5 1 1 2 4 12 7 3 3 13 
о) @ 6) 6 «€ Q 
6 1 1 5 7 13 9 5 3 17 
(3) (5) © 6) (5 (9 
7 2 8 9 19 14 5 3 1 9 
Column 
totals 20 26 32 78 35 26 21 82 
e Rep 1 Rep2 Total 
1 12 =P, 1=P, 19 = B, 
2 18 17 35 
3 6 13 19 
4 12: 6 18 
5 4 13 17 
6 7 17 24 
1 19 =P, 9 = Pu 28 = B, 
78 82 160 
Treatments ti Buy W; = АТ; — 6В + 2G О; = ЗТ, — Buy 
1 17 71 —38 —20 
2 13 61 6 -22 
3 12 63 -10 -27 
4 18 60 32 a 
5 34 87 —66 15 
6 27 66 32 15 
T 39 72 44 45 
160 480 0 0 
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Table 9.8-2 Computational Procedures 


(1) =@Jknt = 609.52 (4) = (2B) /kn = 653.33 
B) =>Х% = 876 (5) = [X(col. total)?]/t = 634.57 
(3) = (ЕТ?)/Кп = 722.00 (6) = (“РЕК = 718.67 
(7) = X(rep)?/kt = 609.90 
„= e Us 29.52 
КУУТ ЕЕ) 25 
a 
(9) = WERT 98.19 
Source SS df MS 
Columns (5) — (1) = 25.05 5 
Treatments (unadj) (3) — (1) = 112.48 6 
Blocks w. rep (adj) (6) — (4) - (7) 
+ (1) + (8) = 94.48 12 7.87 = E, 
Intrablock error (2) + (4) + (7) — (3) 
— (5) — (6) — (8) = 34.47 18 1.92 =E, 
Total (2) — (1) = 266.48 41 
pe E enam. ы ыз. 07 о оо ОНОН NENNEN 
= n(E, — E,) 2(5.95) — 0522 


nt(k — 1)E, + (t — ЮЕ, 7 28(7.87) + 4(1.92) 


E; = EJI + (t — Юи] = 1.921 + 4(.0522)] = 2.32 
э NM nr tmt 


Table 9.8-3 Adjusted Treatment Totals and Means 


(Intra- and interblock data) 
—— SE i es 1 О nnm 


Treatment Ti = T, + .522W, 1i 
1 15.02 2.5 
2 13.31 22 
3 11.48 1.9 
4 19.67 33 
5 30.55 5.1 
6 28.67 4.8 
7 41.30 6.9 
160.00 - С 26.7 = G|kn = С]6 
XT? с? 
SStreat(aajy = = — к ^ 73040 - 609.52 = 120.88 


MStreat(adjy = 8$үеацаар/(Т — 1) = 120.88/6 = 20.15 
MStreat(aaj) _ 20.15 
F= -N m. m 
E; 7723 8.69 


F 99(6,18) = 4.01 
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the sum of the two repetitions of basic block 2, 18 + 17 = 35, will be 
designated By. 

Summary data required for the sums of squares are given in the lower 
part of Table 9.8-1. Ап entry in column 7; is the sum of the observations 
on treatment j. For treatment 1, 


3.5 basic block 2: subjects 2 and 9 
2+3 Бавіс block 3: subjects 3 and 10 
1 +3 basic block 5: subjects 5 and 12 


17 =T, 


An entry in column Ba is the sum of the totals for basic blocks in which 
treatment / appears. For treatment 1, 


35 total for n repetitions of basic block 2 
19 total for n repetitions of basic block 3 
17 total for n repetitions of basic block 5 


71 = Ва 


Convenient computational symbols are defined in the upper part of 
Table 9.8-2. For example, 


jae T ro тар} 

(5) munie 634.57, 

(6) 12: F са 17 +. 9° 718.67, 
(7) = wm = 609,90. 


Ineach case, the divisor is the number of observations in a quantity which is 
squared in the numerator. For example, the quantity 78 in (7) is the sum of 
the 21 observations in repetition 1. aum d 

Computational formulas for the sums of squares are given in the lower 
part of Table 9.8-2. Procedures for obtaining the weighting factor и and 
the effective error mean square E; are also indicated. 

Adjusted treatment totals and means are given in Table 9.8-3. Both 
intrablock and interblock treatment information are included in the adjust- 
ment process. Procedures for making an approximate test on the over-all 
treatment effects are summarized. In this case the observed Fratio exceeds 
the critical value for a .01-level test. Hence the experimental data reject the 
hypothesis that there are no differences between the treatment effects. To 
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Table 9.8-4 Summary of Analysis of Variance 
(Intrablock treatment information only) 


Source | 55 df | MS F 
Columns (5) — (1) = 25.05) 5 
Blocks w. reps (unadj) (6) — (7) = 108.75| 12 
Treat (adj for blocks) (9) = 98.19) 6! 1636 | 8.52 
Intrablock error (2) + (7) — (5) — (6) — (9) = 34.47] 18 1.92 
Total (2) — (1) = 266.46 


E; = EJE = E,[r(k — 1)/k(t — 1)] 
= 192.778 = 2.47 


es U ЗЕ. 


test, for example, the difference between the mean rating for packages 5 and 
7, the test statistic has the form 
je (T; — Т;)? _ (30.55 — 41.30)? 
2knE/, 2(6)(2.32) 
The critical value for a .01-level test is Е (1,18) = 8.29. 
The critical value for the Tukey hsd test on all possible differences between 


pairs of adjusted means for a conservative .01 significance level on the 
entire set of comparisons is 


дайын E = зл [222 _ з 
Кп 6 


Two adjusted means are said to be Statistically different if their difference 
exceeds the value 3.60. With this Overstringent decision rule, package 7 


differs from 1, 2, 3, and 4, but no other differences are statistically signifi- 
cant at a .01-level Tukey hsd test. 


= 4.15. 


Table 9.8-5 Adjusted Treatment Totals and Means 
(Intrablock information only) 


TUE G "0.6 
T nth í tA 
reatment 60 Q 
= 47 + 6.81) З 
1 14.29 2.4 
2 13.43 2.2 
3 11.35 1.9 
4 20.29 3.4 
5 29,29 4.9 
6 29.29 49 
7 42.15 7.0 
160.09 26.7 = С]кп = G/6 
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If only the intrablock treatment information is used in estimating treat- 
ment effects, computational procedures for sums of squares are indicated 
in Table 9.8-4. Computational symbols are those in Table 9.8-2. A test 
on the over-all difference between treatment effects can be made in the 
analysis-of-variance table by means of the statistic 


F= MSireat(adi tor blocks) 
Cl I Be 


e 


Adjusted treatment totals and means for this case are summarized in 
Table 9.8-5. A test on the difference т; — 75, for example, has the form 


-T-T 
2пКЕ" 


In this example, the analyses with or without the use of interblock treat- 
ment information give similar results. 


9.9 Partially Balanced Designs 

All the designs which have been discussed so far in this chapter are 
balanced in the sense that each treatment is paired with every othertreatment 
an equal number of times within a common block; that is, 4 was a constant 
for all treatments. These balanced designs may be considered as special 
cases of a broader class of designs known as partially balanced incomplete 
block designs. In the latter some pairs occur together within a common 
block more often than do others. Corresponding to each treatment are 


two or more classes of associates. и ; 
To illustrate what is meant by a class of associates, consider the following 


design, in which the number of treatments ¢ = 10, the block size k = 4, the 
number of blocks b = 5, and the number of replications r = 2: 


-————— 


Block 
a 1 2 3 4 
b крсу) 5254 7 
с Da a ЕС] 
d en ONES 
e ae 10. 


Note that treatment 1 occurs in the same block with treatments 2 through 7, 
but treatment 1 does not occur in the same block with treatments 8,9, or 10. 
Treatments 2 through 7 are said to be the first associates of treatment 1; 
treatments 8, 9, and 10 are said to be the second associates of treatment 1. 
Alternatively, the first associate class of treatment | is the set of treatments 
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2 through 7, and the second associate class of treatment 1 is the set of treat- 
ments 8, 9, and 10. A partial listing of associate classes for this design is 
given below: 


ee ы. 


Treatment First associates Second associates 
1 2.3.45. 6 7 8,9, 10 
2 1, 3, 4, 5, 8,9 6, 7, 10 
9 2, 4, 5, 7, 8, 10 1, 3,6 
10 3, 4, 6, 7, 8, 9 1; 255 


In general if treatments i and / are first associates, then the following con- 
ditions must hold. 

1. The number of treatments common to the respective first associate 
classes (pi) is a constant for all г andj's. For the design given above this 
constant is pj; = 3. In the above example, the first associate classes of 
treatments 1 and 2 have treatments 4, 5, and 9 in common; the first 
associate classes of 2 and 9 have treatments 4, 5, and 8 in common, 

2. The number of treatments common to the second associate classes is a 
constant for all i’s and J's. In this case, the constant is рь = 1. Inthe 
above example, treatment 7 is common to the second associates of treat- 
ments 1 and 2; treatment 6 is common to second associates of 2 and 9. 

3. The number of treatments common to the first associates of i and the 
second associates of j (or vice versa) is constant. In this case pj, —2. In 
the above example, the first associate class of treatment | and the second 
associate class of treatment 2 have 6 and 7 in common; the first associate 
class of 2 and the second associate class of 9 have 1 and 3 in common. 

If treatments / and j are second associates, then the following conditions 
are required for partial balance: 

1. The number of treatments common to the respective first associates is 
constant for all i’s and Js. Inthe example, treatments 1 and 9, which are 
Second associates, have four treatments in common in their respective first 
associate classes; similarly, treatments 2 and 10 have four treatments in 
common in their first associate classes. Hence, for this case, pu = 4. 

2 Тһе number of treatments common to respective second associate 
classes is constant. In the example, the second associate classes of 1 and 
10 have no treatments in common; similarly the second associate classes of 
1 and 9 have no treatments in common. Іп this case pł, = 0. 

3. The number of treatments common to the first associate class of i 
and the second associate class of J (or vice versa) is constant. In the 
example, рї, = 2. 

In the general case of partially balanced incomplete-block designs, each 
treatment may have л associate classes. Treatments in the first associate 
class of treatment i are paired with treatment ; in A, blocks; treatments in 
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the second associate class of i are paired with i in A, blocks; . . .; treatments 
in the nth associate class of i are paired with iin 2, blocks. Balanced in- 
complete-block designs form that special case of partially balanced designs 
in which there is one associate class for each treatment. The symmetries 
imposed upon the structure of partially balanced designs are what permit 
simplified solution of the normal equations obtained in using the least- 
squares principle in the estimation process. 

Partially balanced designs may be analyzed either with or without the 
recovery of interblock treatment information. Principles underlying the 
analysis will be found in a readable article by Bose and Shimamoto (1952). 
Extensive tables of partially balanced designs have been compiled by Bose 
et al. (1954). Since all treatments are not paired an equal number of times 
within a common block, in comparing pairs of treatments the proper error 
term depends upon the kind of association between the pair in question. 
One of the major disadvantages of partially balanced designs, in contrast to 
completely balanced designs, is the fact that differences between all pairs of 
treatments cannot be estimated with the same degree of precision. How- 
ever, this latter disadvantage can, in part, be turned into an advantage. If 
greater precision is desired for some comparisons, the experimenter may 
assign treatment numbers to the treatments in such a way that pairs on 
which greater precision is desired are those which occur together in the same 
block most frequently. 

Cases of Special Interest. The following partially balanced incomplete- 
block design resembles a Youden square—each of the columns defines a 
replication. In this design, however, each treatment has two associate 


classes. 
pip a NE ышы эе кл ё жыйы 


Block 

a {БА Ас T ES 
b 23 8120132376 
c 27 64772724 
а хал EP 
е жаа Оу 56-23 
gl ped ен 


A partial listing of the associate classes is as follows: 


aa ee ЗИРЕ 
First Second 


Treatment | associates | associates 
1 4 2, 3, 5,6 
2 5 1, 3, 4, 6 
6 3 1.2.5.9. 
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The first associates appear together in 4, = 4 blocks, whereas the second 
associates appear together in 4, = 2 blocks. For example, treatments 2 
and 5, which are first associates, appear together in blocks a, b, d, and е. 
The Youden square is that special case of this design in which all treatments 
have only one associate class. This plan is actually superior to a Youden 
square for those situations in which greater precision is required for first 
associates than for second associates. 

In addition to the two kinds of balanced two-dimensional lattice designs 
that have been discussed, there are also partially balanced two-dimensional 
lattices. For example, there are designs in which К? treatments are ar- 
ranged in blocks of size k with (k + 1)/2 replications. In the latter design, 
each treatment has two associate classes. Lattice-square designs may also 
be constructed with ( + 1)/2 replications. Іп two-dimensional lattice 
designs, a double lattice is one in which the number of replications is a 
multiple of 2; a triple lattice is one in which the number of replications is a 
multiple of 3. Consider the following triple lattice (since? = 9 = 32 = Ke 
the following design may be classified as two-dimensional): 


Replication 1 Replication 2 Replication 3 
Block Block | Block | 
a IM 2514 4 Do x6» G7 g | 1 4 
BATA isa 16 e do 5419 h | 3. 6 
c Пада 5:9 if. Зь o8 i. |2 5 D 


In this design, each treatment has two associate classes. For the first 
associates 2, = 1; for second associates 4, = 0. Each treatment has only 


a single second associate. Hence this triple lattice is close to being 
balanced. 


of size k. This type of design may be related to a k x k x k factorial 
experiment arranged in blocks of size k. The principles underlying the 
analysis of variance for this case are essentially those of the corresponding 
factorialexperiment. A complete replication requires k? blocks. Partially 
balanced designs of this kind require a minimum of three, or some multiple 
of three, replications. The computational procedures for a three-dimen- 
sional triple lattice for k? — 27 treatments in blocks of size k = 3 withr = 3 
replications are given in Federer (1955, pp. 349-355). 

_ Rectangular lattices form a broad category under the partially balanced 
incomplete-block designs. These designs may be considered as two-dimen- 
Sional lattice designs having one restriction, but the number of treatments 
is kp. Of special interest are the So-called rectangular lattices having 
K(k + 1) treatments in blocks of size k. The following design represents 
a triple lattice (i.e., the number of replications is a multiple of 3) having 
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КЕ + 1) = 12 treatments arranged in blocks of size k = 3: 


Replication 1 Replication 2 Replication 3 
Block Block Block 
a 1 2,3 Jh ПИ J 150 Ж.) 
b NONEM б g 2.5, 42 k 2,9% ,10: 
с M e 7: h 2. “ӨТ” 9 1 3. 4. U 
d | 10 111302 i Ж S730 m 6 8 12 


In the above design, treatments which are first associates appear together in 
À, — 1 block; treatments which are second associates appear together in 
4, =0 block. The general procedures for handling partially balanced 
incomplete-block designs may be used for rectangular lattices. However, 
the general computational formulas can be simplified for these special 
designs. 

Designs for k = 2. Of special interest in the behavioral sciences are 
designs having block size К = 2. А block may consist of a pair of twins, a 
husband and wife, the right and left hand of an individual, еіс. А balanced 
design for the case k = 2 requires a minimum of ¢(¢ — 1)/2 blocks. For 
example, a balanced design for the case in which = 10 requires a minimum 
of 45 blocks. If the number of treatments is at all large, the only feasible 
design may be a partially balanced design rather than a balanced design. 

A special class of partially balanced designs for the case in which k = 2 
has been developed by Zoellner and Kempthorne (1954). These designs 
are knownas circulant designs. Although the number of associate classes 
for these designs increases as the number of treatments increases, the com- 
putational work for circulant designs is relatively simple when the design 
constants associated with the reduced normal equations are known, The 
latter constants have been tabulated for cases in which the design corre- 
sponds to specified plans. [ | ) 

A plan for a design in which t = 7 and = 4is specified by the following 
set of numbers: 

лме е (м OF eet 


To determine the treatments which occur іп the same block with treatment 
1, for example, one adds to the digit 1 the numbers 1,2, —, —, 5, 6. The 
latter set of numbers corresponds to the pattern (1 1 0 0 1 |). 
Thus the treatments which appear in the same block with treatment l are 
treatments 2, 3, 6, and 7. To find the treatments appearing in the same 
block with treatment 5, one adds to the digit 5 the same set of numbers, sub- 
tracting г = 7 if the resulting sum exceeds 7. Thus the treatments paired 
with treatment 5 are б, 7, 3, and 4. In general, the treatments paired with 
treatment j are j + 1,j + 2,j + 5, and j + 6, t = 7 being subtracted when 
the sum exceeds 7. 
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A plan for a design in which / = 8 and r = 6 may be specified by the 
following set of numbers: 
Е Т Т И. 
The digits to be added to a treatment number аге 
h 283—526. 7 
Thus the treatments which appear in the same block with treatment jare 
IELI FA )-3, jt 5, 46, 7 +7. 
Ifa sum exceed f = 8, then the corresponding treatment is defined to be the 
sum minus 8. Тһе design constants associated with designs constructed in 


accordance with these plans have been tabulated by Zoellner and Kemp- 
thorne (1954, р. 179). Some of the plans in this series for which the design 


Table 9.9-1 Plans for Circulant Designs for k = 2 
rrr 


1 r Plan 

6 4 "TOt per 

7 4 *pbgt6olg 3 

8 6 Se Т аб 2121 

8 5 бале Ea И ОЗТ. 

8 4 ае 109 T 

8 3 ТОТО ОЖТ 

9 6 МІ Шыны ШЕН 1 
9 4 ОО 1 991 
10 5 Жи Ө: Мы Өчө. І 0-1 
10 4 ПОЛО. 0..1 00 1 


constants have been tabulated are given in Table 9.9-1. The structure of 
the blocks for the design in which t = 6 and r = 4, having the plan 
(ошо; a 1), is as follows: 


(1,2) (1,3) (1,5) (1,6) (2,3) (2,4) (2,6) (3,4) (3,5) (4,5) (4,6) (5,6) 


The efficiency factor for the designs given in Table 9.9-1 is in the range 
from.40 to 60. A numerical example of the computational procedures for 
these designs is given by Zoellner and Kempthorne. 


9.10 Numerical Example of Partially Balanced Design 


Although most of the examples that have been given in this chapter have 
been cast in terms of experiments having repeated measures, it should be 
noted that a block may represent any source of variation that an experi- 
menter may wish to control—a source of experimental material, a time of 
day, an experimenter. The alternative to arranging an experiment in 
blocks is complete randomization. This alternative will serve, in the 
long run, to avoid confounding of treatment effects. However, the varia- 
tion due to the potential blocks becomes part of the experimental error. 
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When the experimental material is arranged in blocks, variation due to 
the latter source is not part of the experimental error. 

The numerical example to be considered in this section is for the case in 
which the number of treatments t = 10, the block size k = 4, and the num- 
ber of replications r = 4. With tables of this design are the design con- 
stants c, = 15, Ca = 5, Лу = l, and A, = 2. The actual structure of the 
design as well as the criterion scores are given in Table 9.10-1. 


Table 9.10-1 Numerical Example of Partially Balanced 
Incomplete-block Design 


(t = 10, k =4,r =4, с, = 1%, с; = ү, А = 1, 22 = 2) 
Block В, 


OTO 9r RO 
1 18: жіз 16, 124) 59 
0 Q (9 00 
21:11:15 22м 121 
о ө (D (9 
3 13 18 10 14 | 55 


(6) (4) © (9 
18 17 


85 


4 15 11 | 61 
@ (1) do9 (9) 

5 72 EEN] 1016 046 
() (0 eo 0 

6 5 12 8 9 34 


à © 9 Qo 
7 14 15 19 12 60 


00 0 8 Q9 
8 15 19 24 18 76 


© ao (0) (1) 
7 15 7 7 


(0 (9 (9 @ 
1015226 MTSE eT |2278 


This design has two associate classes. For example, first associates of 
treatment 1 are 2, 3, 4, 5, 6, and 7; treatment І appears with each of these 
treatments іп д, = 1 block. The second associates of treatment 1 are 8, 9, 
and 10; treatment 1 appears with each of these treatments ind, = 2 blocks. 

The columns of this design form complete replications, but the replica- 
tions aspect will be disregarded in the analysis that follows—assume that the 
treatments within each block arearranged ina randomized sequence. Sum- 
mary data are given in Table 9.10-2. An entry in column Т; is the sum of 
ther = 4 observations on treatment i. Ап entry in column Bi is the sum 
of the totals for blocks in which treatment / appears. An entry in column 
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LLL ML Luc 
Cate cee eae 
mdr rd 2 
hee le i 


Table 9.10-1a Associate Classes for Design in Table 9.10-1 
First 


RO. - 100, 


(0 = MINI + 1425-7, 


О, has the general form 


мо, + 


Citic 
PAETECT IE 


p 


canes 


-——— UT 


ч) ~ -10, 
w = 120, ~ 


|---- 


9, = ат, - 


will be wied to designate the sum of those (0/% which 
amocistes of treatment /, For example, 


209‹=0%+0,+0,+ 0,+ 0, + 0, — —%, 
Om 0 +0,+ 0,+ @ + 0,+0,— —55 


E 


Ты symbol 
correspond to 
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These sums appear in column а in the table. There йв а simpler method for 
obtaining these sums. Suppose that the symbol YQ, represents the sum of 


the 0,» to the treatments which are second avvociates of 
treatment 4 It is shown that 
ро. + 50. + 0-0 


For example, for treatment 1, 
жо rga tam PCM ceo 


Hence, 50. - -(50: +a) 

In this deiga оми поа МУ MM Md МИН at au Aes 
associates, the computation of ХО, involves leis work than dows the 
computation of YO, The former are inthe columa 3 Q,of the table, 


the latter in column а. И will be found that entries in just one of these two 
columns need to be obtained, 
The intrablock estimate of the treatment effect is in general 


"а — My, = c 00, + б 930. 
Vor the design in Table 9.1041, 
4u, = (4 = А, + (A АЖ, 

which becomes, upon clearing fractions, 

TI, = 520, ~ 430. 
Hoth sides may now be divided by 4 to give 

пх, = DO, - 30. 
Омар the relation between 20, and 20. 

йм, = 130, + @ + 30. 
= 40, + 20, 

Entries in column b of Table 9.10-2 are obtained most говду from this Шм 


of adjuvted treatment meam, by wing intrabloxi informations, 
have the form eer | 


Each entry in column b estimates 10У, Hence 
r mecs + С. 
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Computational formulas for the sums of squares are given in Table 
9.10-3. Since each entry in column b of Table 9.10-2 is 180/,, symbol (5) 
is computed as follows: 


ive 
(5) = твор (180000, 
= rho[(—668)(—59) + (—738)(—61) + - - - -+ (1192)(95)] 


= 372.46, 

Table 9.10-3 Computational Formulas for Sums of Squares 
(1) = бене = 8702.50 (3) = (2T%)/r = 9226.50 
(2) = ХХ? = 9826 (4) = (2B})/k = 9370.00 


(5) = 1240, - 37246 


Source 


Blocks (unadj) 
Treatments (adj for blocks) 41.38 10.40* 
Intrablock error (2) = (4) - (5) = 83.54) 21 3.98 = E, 
Total (2) = (1) = 1123.50 39 
Treatments (unadj) — (3) — (1) = 52400 “Ғ,(921) = 2.37 


Ej = (9) - (= %) - 495 
е ве 4 — ү 
Е (ә) Е 3 (1,5) = 4.60 
E m mEi + mE, 604,95) + 34.60) _ 4,83 
M + My 9 
ee ee U 
The analysis of variance is also summarized in this table. A test on the 
over-all hypothesis that o? = 0 may be made from the data in the analysis- 
ч table—the latter utilizes only the intrablock treatment informa- 
Effective error terms for comparisons between treatments which are first 
ог second associates are also indicated, When these two errors do not 
differ appreciably, an average error may be computed as 
E; = Ei + mE; 
m+ my 
where n, and n, are, respectively, the number of first and second associates 
parisons between 


for any treatment. Com two treatments which are first 
associates have the following form: 


г 470 
Eir C 
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Comparisons between two treatments which are second associates have the 


following form: in = ту 
2Ejr 
It should be noted that, when д, < As, E; will be than E. 

When the interblock variance is large relative to the intrablock variance 
(in this case the interblock variance is quite large relative to E,), the inter- 
block treatment information may generally be Lr ate Computa- 
tional details for the recovery of interblock treatment information are given 
in Cochran and Cox (1957, pp. 460-463). 


9.11 Linked Paired-comparison Designs 

Designs having block size equal to 2 are of particular interest in - 
comparison work, Bose (1956) has described a al class of Бары 
which he calls linked paired-comparison designs. ond by Bo 
to balanced incomplete-block designs. The notation used by Bose is as 
follows: 
n = number of objects to be compared, 
v = number of judges to be used. 
r = number of pairs of objects to be compared by each judge, 
b = n(n — 1)/2 = number of pairs. 
The following conditions are required. 

1. For the r pairs of objects compared by cach of the judges, each object 
must appear a times, judged by k) bt 

2. Each pair of objects is judges, where k > 1, 

3. For ж кеі ges, there are exactly A pairs of objects which are com- 
mon to the ғ pairs assigned қам 

The following design із an a linked paired-comparison design 
in which л = 4,0 = 3, b = 6,6 = 4, k = 2,2 = 2, and a = 2: 


Judge Pairs assigned to judge 
a (140, (1.9, 2A), 2) 
b (1,3) (2,4), (1,2), OAD 
e (1,4), (1.2), (2,2), OAD 


Among the r = 4 rs of objects assigned to a judge, cach appears 
а= гү: же An re je ad = 2pairsincommon. For example, 
judge a and judge В have the pairs (1,3) and (2,4) in common. 
There is a between linked block 


and balanced 
incomplete-block designs. Bose indicates a method for obtaining the 
former from the latter. In this correspondence, has the role of a 


treatment, and a set of of treatments has the of a block. To 
illustrate ths correspond nec, suppose that is an even number and that the 
symbols with asterisks represent parameters of a balanced incomplete- 
block design; then the parameters of a linked paired-comparison design are 
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related to those of a balanced incomplete-block design (marked with an 
asterisk) as follows: 
* * 
v=v*, pa ; k=k, 2 nh я a= r*. 
2 2 
(In this context v* represents the number of treatments.) Thus, if 
п = 8, 0= 7, г = 12, к= 3, д = 4, and х = 3, 
for a linked paired-comparison design, then 
v* = 7, =} k* —3, At = 1 


for the corresponding balanced incomplete-block design. In the latter 
design, 


v*r* — 7(3) 
еке ала e UR vr = 
i47 k* 3 f: 
To illustrate the method of obtaining the linked paired-comparison 
design from the balanced incomplete-block design described in the last 
paragraph, suppose that the latter design is represented as follows: 


Blocks Treatments 
I Отогот 
i b c e 
Ш «жй 
IV de г 
У See fi a 
VI mi b 
VIL g a с 


If n = 8, there are 8(7)/2 = 28 pairs of objects. These pairs may be 
arranged in seven sets of four pairs each in a way that no object appears 
twice within апу set. The sets are as follows (note the cyclic arrangement of 
the columns): 


Set Pairs 

I (4,8), (1,7), (2,6), (3,5) 
П (5,8), (2,1), (3,7), (4,6) 
ІШ (6,8), (3,2), (4,1), (5,7) 
IV (7,8), (4,3), (5,2), (6,1) 
у, (1,8), (5,4), (6,3), (7,2) 
VI (2,8), (6,5), (7,4), (1,3) 
Vil (3,8), (7,6), (1,5), (2,4) 


Suppose that the treatments in the balanced incomplete-block designs are 
considered to be judges; suppose that the judges are assigned the sets of pairs 
corresponding to the block in which the corresponding treatment is located. 
That is, since treatment а appears in blocks I, V, and VII, the corresponding 
sets of pairs are assigned to judge a. Similarly, since treatment b appears in 
blocks I, II, and VI, the corresponding sets of pairs are assigned to judge b. 
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In this way each judge is assigned three sets of four pairs each. Hence each 
judge makes r = 12 comparisons. Ina complete block design, each judge 
would make 28 comparisons. 

Bose has not suggested special methods for analyzing data obtained from 
a linked block paired-comparison design. There are several general 
methods for analyzing paired-comparison data. A review of these methods 
as well as an extensive bibliography will be found in Jackson and Flecken- 
stein (1957). Іп particular, the method suggested by Scheffé could be 
adapted for handling the analysis of data obtained from linked block paired- 
comparison designs. These designs should prove to be particularly useful 
in psychometric scaling. 


CHAPTER 10 


Latin Squares and Related Designs 


10.1 Definition of Latin Square 


A Latin square, as used in experimental design, is a balanced two-way 
classification scheme. Consider the following 3 x 3 arrangement: 


a b c 
b c a 
(4 а b 


In this arrangement, each letter occurs just once in each row and just once 


in each column. The following arrangement also exhibits this kind of 
balance: 


(е а b 


The latter arrangement was obtained from the first by interchanging the 
first and second rows, 

Use of this type of balance may be incorporated into a variety of designs. 
For example, consider an experiment involving the administration of three 
treatments to each of three subjects. Тһе order in which subjects receive 
the treatments may be either completely randomized or randomized under 
the restriction of balance required for the Latin square. If the treatments 
are designated 4, dg, and ay, a balanced arrangement with respect to order of 
administration is obtained by employing the following plan: 


Order 1 Order 2 Order 3 
Subject 1 a а аҙ 
Subject 2 аз а а 
Subject 3 аз аҙ a 


If this plan is followed, each treatment is administered first once, second 

once, and third once. If there is a systematic additive effect associated with 

the order of administration, this effect can be evaluated. Had order of 
514 
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administration been randomized independently for each subject, this 
balance would not in general be achieved and evaluation of the order effect 
could not be made readily. (In the design outlined above, instead of indi- 
viduals, groups of subjects may be assigned to the rows of the Latin square.) 

Two Latin squares are orthogonal if, when they are combined, the same 
pair of symbols occurs no more than once іп the composite square. For 
example, consider the following 3 x 3 Latin squares: 


(1) (2) (3) 
а а аз ba bs by сі E сз 
ay а ay b by b ‚ИМЕ с 
4 а а by by bg са са e 


Combining squares (1) and (2) yields the composite square 
аһ а gh 
а,б asb aby 
абу абз афа 
In this composite the treatment combination a,b, occurs more than once; 
hence squares (1) and (2) are not orthogonal. Н 
Combining squares (1) and (3) yields the following composite: 
асу ас; а» 
Gy ас, 4% 
ас, ас; ас 
In this composite no treatment combination is repeated. There are nine 
possible treatment combinations that may be formed from three levels of 
factor A and three levels of factor C. Each of these possibilities appears in 
the composite. Hence squares (1) and (3) are orthogonal. И MA 
Extensive tables of sets of orthogonal Latin squares are given in Fisher 
and Yates (1953) and Cochran and Cox (1957). The composite square 
obtained by combining two orthogonal Latin squares is called a Greco- 
Latin square. The 3 x 3 Greco-Latin square obtained by combining 
squares (1) and (3) above may be represented schematically as follows: 


11 2 233 
23 31 12 
32:11 13. 121 


This representation uses only the subscripts that appear with the a's and 
с^. Interchanging any two rows of a Greco-Latin square will still yield a 
Greco-Latin square. For example, interchanging the first and third rows of 
the above square yields the following Greco-Latin square: 


32 43 2 
23: 31 12 
п 22 33 


Any two columns of a Greco-Latin square may also be interchanged without 
affecting the required balance. 
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Itis not always possible to find a Latin square orthogonal to a given Latin 
Square. For example, no orthogonal squares exist for 6 х 6 squares, 
Also, no orthogonal squares exist for 10 x 10 squares. If the dimension 
ofa Latin square is capable of being expressed in the form (prime number)", 
where л is any integer, then orthogonal squares exist. For example, or- 
thogonal Latin squares exist for squares of the following dimensions: 


dL 4-2 ы аты а= 22, Dai 


However, neither 6 nor 10 may be expressed in this form. In addition, 
there are cases in which orthogonal squares exist when the dimension of the 
Square is not of the form (prime number)"—particularly when the dimension 
isdivisibleby4. For example, orthogonal squares may be constructed for 
the 12 x 12 square. 

Forap x psquare which has orthogonal squares, there are at mostp — 1 
Squares in an orthogonal set. The latter is a collection of p x p squares 
which are mutually orthogonal to each other; i.e., each square is orthog- 
onal to all others in the set. Three or more orthogonal squares may be 
combined to yield hyper-Latin squares. Such squares аге readily obtained 
from tables of complete sets of orthogonal squares. 


10.2 Enumeration of Latin Squares 


The standard form ofa Latin Square is, by definition, that square obtained 
by rearranging the rows and columns until the letters in the first row and 
the letters in the first column are in alphabetical order. For example, ће 
square 


b a è 
a) Con a 

a с b 
has the standard form 

a b Ге 
(2) Ж КАЛЫ 

с а b 


The standard form (2) is obtained from (1) by interchanging columns | and 
2. , All3 x 3 Latin Squares may be reduced to this standard form. From 
this standard form (3!) - 2!) —1=(3-2-1)2-1) —1 =11 different 
nonstandard 3 x 3 Latin squares may be constructed. Hence, including 
the standard form, there are 12 different 3 x 3 Latin squares. 


For 4 x 4 Latin squares there are 4 different standard forms. One of 
these 4 is given in (3). 


(3) 


anoa 
оазе 
8 46 
Saana 
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A second standard form is given by (4). 
b с 
e ч 


Rees 
OSAA 


[4 
d 
a b 
From each of these standard forms (4!)(3!) — 1 = 143 different nonstand- 
ard 4 x 4 Latin squares may be constructed by the process of interchanging 
rows апа columns. Nonstandard squares constructed from standard form 
(3) will be different from nonstandard squares constructed from standard 
form (4). Thus (3) potentially represents 144 Latin squares (143 non- 
standard squares plus | standard square). Standard form (4) also repre- 
sents potentially 144 different 4 x 4 Latin squares. Since there are 4 
different standard forms, the potential number of different 4 x 4 Latin 
squares is 4(144) = 576. 

The number of possible standard forms of a Latin square increases quite 
rapidly as the dimension of the square increases. For example, a 6 x 6 
Latin square has 9408 standard forms. Each of these forms represents 
potentially (6!)(5!) = 86,400 different squares. The total number of 
different 6 x 6 Latin squares that may be constructed is 


(9408)(86,400) — 812,851,200. 


In most tables, only standard forms of Latin squares are given. To 
obtain a Latin square for use in an experimental design, one of the standard 
squares of suitable dimension should be selected at random. Тһегоу апа 
columns of the selected square are then randomized independently. Тһе 
levels of the factorial effects are then assigned at random to the rows, 
columns, and Latin letters of the square, respectively. To illustrate the 
procedure for randomizing the rows and columns ofa4 x 4 Latin square, 
suppose that square (3) is obtained from a table of Latin squares. Two 
random sequences of digits 1 through 4 are then obtained from tables of 
random sequences. Suppose that the sequences obtained are (2,4,1,3) and 
(3,4,1,2). The columns of square (3) are now rearranged in accordance 
with the first sequence. That is, column 2 is moved into the first position, 
column 4 into the second position, column | into the third position, and 
column 3 into the fourth position. The resulting square is 


b ae с 
а c nod 
d a c b 
c b ағ а 


The rows of the resulting squares аге now rearranged in accordance with the 
second random sequence. Thatis, row 315 moved to the first position, row 
4 moved to the second position, row 1 moved to the third position, and row 
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2 moved to the fourth position. After these moves have been made, the 
resulting square is 


осо 
еше 
FARO 
once 


This last square represents a random rearrangement of the rows and 
columns of the original 4 x 4 standard form. This last square may be 
considered as a random choice from among the 144 squares represented. 
by the standard form (3). Since (3) was chosen at random from the stand- 
ard forms, the square that has just been constructed тау be considered а 
random choice from the 576 possible different 4 x 4 Latin squares. 

Some additional definitions of types of Latin squares will be helpful in 
using tables. The conjugate of a Latin Square is obtained by interchanging 
the rows and columns. For example, the following squares are conjugates: 


a b с а с b 
c a b b a c 
b c a c b a 


The square on the right is obtained from the square on the left by writing 
the columns as rows. That is, the first column of the square on the left 
is the first row of the square on the right, the second column of the square 
on the left is the second row of the Square on the right, etc. Conversely, 
the square on the left may be obtained from the square on the right by 
writing the rows of the latter as columns. 

A one-step cyclic permutation of a sequence of letters is one which 
moves the first letter in the sequence to the extreme right, simultaneously 
moving all other letters in one Position to the left. For example, given 
the sequence abed, a one-step cyclic permutation yields beda; a second 
cyclic permutation yields cdab; a third cyclic permutation yields dabc; 
and a fourth cyclic permutation yields abcd—the latter is the starting 
Sequence. Given a sequence of p letters, a p x p Latin square may be 
constructed by p — 1 one-step cyclic permutations of these letters. In 


the case of the sequence abed, the Latin Square formed by cyclic permuta- 
tions is 


anona 
aano 
DAAN 

ecan 


A balanced set of Latin Squares is a collection in which each letter 
appears in each possible position once and only once. For a p x p 
square, there will be p squares in a balanced set. Givenany p x p square, 
a balanced set may be constructed by cyclic permutations of the columns. 
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For example, the squares given below form a balanced set: 


I I ш 
а b с b с а c a b 
е а b a b c b c a 
b c a c a b a b c 


Square II is obtained from square I by moving the first column of square I 
to the extreme right. Square III is obtained from square II by moving the 
first column of square H to the extreme right. The resulting set is balanced 
in the sense that each of the nine positions within the squares contains any 
given letter once and only once. 

One square may be obtained from another by substitution of one letter 
for another. For example, given the following square: 


a b c d 
Ба аттес 
с жад а 
дз СА cac Sb 


Suppose that each of the a's is replaced by a b, each of the P's replaced by a c, 
each of the c’s by a d, and each of the d's replaced byana. This kind of 
replacement procedure is called a one-to-one transformation. The square 
resulting from this transformation is given below: 


Б с сак. 
ts Фуйү @ 
As ited ali? 
aes "bl x 
This last square has the standard form given below: 
qp Ate id 
io idl З. 
Ch fait die 15 
ақа OE: 


Since this latter square has a different standard form from that of the 
original square, the one-to-one transformation yields a square which is 
different from any square which can be constructed by permutation of the 
rows or columns of the original square. 


10.3 Structural Relation between Latin Squares and Three-factor 
Factorial Experiments 

There is an interesting structural relation between ap x p X p factorial 
experiment and a p X p Latinsquare. This relation will be illustrated by 
means of a 3 x 3 x 3 factorial experiment. It will be shown that the 
latter may be partitioned into a balanced set of 3 x 3 Latin squares. In 
the following representation of a 3 х 3 x 3 factorial experiment, those 
treatment combinations marked X form one Latin square, those marked 
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Ү form a second, and those marked Z form a third. The set of three 
Squares formed in this manner constitutes a balanced set: 


сұ | Cy сз 


b, ba 8 by by 8 b, b, bs 


а асан d рд: cde 
ap Wu cx b Yn zu vox ox 
@ GC ext ym. rz 


The treatment combinations marked Y may be grouped as follows: 


XXN 


X 
2 
Y 


abcy abcis» abcey33 
абс абсэзз abcess 
абсззу абсзуә абсзэз 


This set of treatment combinations may also be represented in the follow- 
ing schematic form: 


(х) 
by b, bs 
а сі C2 сз 
а єз сі E 
а | с єз сі 


In this latter arrangement the c’s form a Latin square. 
The treatment combinations marked Y and Z in the factorial experiment 
may be represented by the following Latin squares: 


(Y) (Z) 
by by 8 bi ba bs 
a | % сі C2 а) с єз Сї 
%| с C3 сі a} а Co ul 
a | сү Co C3 4$ | с сі Са 


In a complete factorial experiment there is balance in the sense that treat- 
ment a; occurs in combination with each b; and each c, as well as in com- 
bination with all possible pairs bey. Ina Latin square there is only partial 
balance; each a, occurs in combination with each b; and each с,, but each 
а, does not occur in combination with all possible pairs Әс. For example, 
їп square (Y) only the following combinations of bc occur with level a: 


beis bey beso 
The following combinations of be do not occur with level a,: 
Бсү Әс bes DC bes, beg 


A Latin square is balanced with Tespect to main effects but only partially 
balanced with respect to two-factor interactions. 
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It is of interest to note the parameters estimated by the sum of all observa- 
tions at level a, in Latin square (У). Assume for the moment that the cell 
means have the following form: 

АВС» = и + оу + By + ya + ens + Ви + аула + Bris 
АВС = и + % + Bo + Уз + ёш + а + уп + Вуз 
| 
| 
| 
I 


АВС,» u + жу + Bs + Ya + Ene + оз + ®у + Вуз» 
А =и+а +0 +0 +4, FO + Bye, 
Assuming further that factors A, B, and C are fixed, it follows that 
X; = 9. Уу = 0, Хав, = 0, «у = 0. 
3 к 3 k 


However, Brig + Ву + Вуз 7 0. 


Thus, in addition to sources of variation due to « and ғ, A includes a source 
of variation due to By. Unless оз, = 0, the main effect due to factor 4 will 
be confounded with the BC interaction. 

In general, when a Latin square is considered as a fractional replication 
of a three-factor experiment, unless ой, = 0, the main effect due to factor A 
will be confounded with the BC interaction. Similarly, it may be shown 
that unless o2, = 0, the main effect due to factor B will be confounded with 
the AC interaction; unless ož; = 0, the main effect due to factor C will be 
confounded with the ABinteraction. The variance due to the three-factor 
interaction, o25,, is assumed to be zero throughout. 


10.4 Uses of Latin Squares 

In agricultural experimentation, the cells of a p p Latin square define 
p? experimental units. In this case, the selection of a Latin square defines 
the manner in which the treatments are assigned to the experimental units. 
That is, the procedure followed in selecting the Latin square specifies the 
randomization procedure necessary for the validity of statistical tests made 
insubsequentanalyses. Inits classic agricultural setting, the Latin-square 
design represents a single-factor experiment with restricted randomization 
with respect to row and column effects associated with the experimental 
units, In this design it is assumed that treatment effects do not interact 
with the row and column effects. Fisher (1951) and Wilk and Kemp- 
thorne (1957) carefully distinguish between the use of a Latin square to 
provide the design for the randomization procedure and the use of a Latin 
square for other purposes. , $ 

In biological experimentation, the Latin square provides a method 
for controlling individual differences among experimental units. For 
example, suppose that it is desired to control differences among litters 
as well as differences among sizes within litter for animals assigned to 
given treatment conditions. If there are four treatments, and if the size 
of the litter is four, then a 4 x 4 Latin square may be used to obtain the 


© 
П 
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required balance. Use of this plan provides the experimenter with a dual 
balance. 


Size within litter 
Litter |—— — —— — — 


1 ty ti ty 1 
2 tg ГА t % 
3 ti % ГА ц 
4 fa ГА ty fi 


That is, the experimental units under any treatment are balanced with 
respect to both litter and size within litter. This plan may be replicated 
as many times as is required. An alternative plan, not having this dual 
balance, would assign the experimental units at random to the treatment 
conditions. This plan is to be preferred to the balanced plan when the 
variables under control by restricted randomization do not in fact reduce 
the experimental error sufficiently to offset the loss in the degrees of free- 
dom for estimating the experimental error. 

In the behavioral sciences, the more usual application of the Latin square 
is in selecting a balanced fractional replication from a complete factorial 
experiment. To illustrate this procedure, consider an experiment con- 
ducted in accordance with the following plan (in this plan c, cs, and су 
represent three categories of patients): 


T 
Drug 1 Drug 2 Drug 3 


Hospital 1 сз сі са 
Hospital 2 сі C3 са 
Hospital 3 C3 Cy сі 


Suppose that 15 patients are sampled from each of the hospitals, 5 patients 
from each of the three categories of patients. According to this design: 


Patients from hospital 1 in category 2 are given drug 1. 
Patients from hospital 1 in category | are given drug 2. 
Patients from hospital 1 in category 3 are given drug 3. 


Upon completion of the experiment, drug | is administered to some 
patients from each of the categories as well as some patients from each of 
the hospitals. This same kind of balance also holds for the other drugs. 

In order to have five Observations in each cell, the complete factorial 
would require 45 patients from each hospital, 15 patients in each category 
from each of the hospitals, that is, 135 patients in all. The Latin-square 
design requires a total of 45 patients. In this context, a Latin square 
represents one-third of the complete factorial experiment. Under some 
conditions, however, the Latin-square design may provide the experimenter 
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with all the important information. If interactions with the hospital factor 
may be considered negligible relative to the hospital main effects and the 
drug by category interaction, then this design provides the experimenter 
with complete information on the main effects of drugs and categories as 
well as partial information on the main effects of hospitals and drug by 
category interaction. Prior experimentation in an area and familiarity with 
the subject matter are the best guides as to whether assumptions about 
negligible interactions are warranted. Pilot studies are also a valuable 
source of information about underlying assumptions. 

Another important use of the Latin square in the area of the behavioral 
sciences is to counterbalance order effects in plans calling for repeated 
measures, For example, consider the following experimental plan: 


Order 1 Order 2 | Order 3 | Order 4 


Group 1 аз а а а, 
Group 2 аз аз а а 
Group 3 а аз а аз 
Group 4 а а ds а 


Suppose that each of the groups represents а random subsample from a 
larger random sample. Each of the subjects within group 1 is given treat- 
ments a, through a, in the order indicated by the columns of the Latin 
square. Upon completion of the experiment, each of the levels of factor 
A will be administered once in each of the orders. Hence there is a kind 
of balance with respect to the order effect. If an order effect exists, it is 
“controlled” in the weak sense that each treatment appears equally often 
in each of the orders. Implicit in this type of “control” are the assump- 
tions of a strictly additive model with respect to the order factor. That 
is, the order factor is assumed to be additive with respect to the treatment 
and group factors; also, order is assumed not to interact with the latter 
factors. (Further, the homogeneity-of-covariance assumptions under- 
lying a repeated-measure design are still required for the validity of the 
final tests.) 1 

In another sense, however, order effects are not undercontrol. Consider 
the administration of a, under order 3. For the plan given above, а; is 
preceded by a; and a; in that order. Should the sequence in which а, 
and ау precede a, have any appreciable effect, this plan does not provide 
an adequate control on such sequence (carry-over) effects. (The term 
sequence effect will be used to indicate differential effects associated with 
the order in which treatments precede other treatments.) Differences 
among the groups in this plan essentially represent differences among the 
four sequences called for by this plan. 4 

A modification of the plan that has just been considered leads to another 
important use of the Latin square. Suppose that each of the groups is 
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assigned to a different level of factor B. If the Sequence effects are neg- 
ligible relative to the effects of factor B, differences among the groups 
measure the effects due to factor B. If interactions with order effects are 
negligible, then partial information with respect to the AB interaction 
may be obtained. Assuming n subjects in each of the groups, the analysis 
of this plan takes the following form: 


————  —— 2-22 


Source of variation df 

Between subjects 4n —1 

Groups (В) 3 

Subjects within groups 4(n — 1) 
Within subjects 12n 

Order 3 

A 3 

(АВ) (partial information) 6 

Residual 12n - 12 


In many applications, the Latin square forms a building block within a 
more comprehensive design. Use of a Latin Square provides economy of 
experimental effort only in cases where certain of the interactions (to be 
specified more completely in the following section) are zero. Complete 
factorial experiments result in unnecessary experimental effort in areas 
where the assumptions underlying the analysis of Latin squares are appro- 


priate. 
10.5 Analysis of Latin-square Designs—No Repeated Measures 


Plan 1. Consider the following 3 x 3 Latin square as a fractional 
replication of a 3 x 3 x 3 factorial experiment: 


қ bs 8 
4) & e сз 
а | с Cy сі 
а а єз C2 


Assume there are n observations in each cell. Assume further that two- 
factor and three-factor interactions are negligible relative to main effects. 
The model for an observation made in cell ijk may be expressed as 


Хик =u + оа) + Pio + Укы) F тезу + 65,655. 


The subscript (s) is used to indicate that the effect in question is estimated 
from data obtained from a Latin square. The term res includes all 
sources of variation due to treatment effects which are not predictable from 
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the sum of the main effects. Under this model, the analysis and the ex- 
pected values of the mean square are outlined below: 
! 


Source of variation df df for general E(MS) 
case 
A 2 PEA o? + npo? 
B 2 p-1 оў + проў 
С 2 pi 02 + про? 
Residual 2 (p = 1p —2) | 9 + от 
Within cell 9(n — 1) p(n — 1) а? 


-------------------------------------- 


If, indeed, interactions are negligible, the variance due to the residual 
sources should not differ appreciably from the variance due to experimental 
error. According to the expected values of the mean squares, when 
ož, = 0, the expected values for the residual and the within-cell variances 


are both estimates of variance due to experimental error. A partial test 
on the appropriateness of the model is therefore given by the F ratio 


F e M Sres ^ 
MS. сей 


The magnitude of this F ratio indicates the extent to which the observed 
data conform to the model (which postulates no interactions). However, 
a test of this kind is somewhat unsatisfactory. Decisions as to the appro- 
priateness of the model should, in general, be based upon evidence inde- 
pendent of that obtained in the actual experiment. Pilot studies, guided 
by subject-matter knowledge, should, wherever possible, serve as the basis 
for formulating the model under which the analysis is to be made. 

Granting the appropriateness of the model, tests on main effects can 
readily be carried out. If interaction is present, some of the tests on main 
effects will no longer be possible. If, for example, all interactions with 
factor A may be considered negligible but the BC interaction not negligible, 
then the main effect of factor A will no longer be clear of interaction effects. 
However, main effects of B and C will be clear of interaction effects; i.e., 
tests on these latter main effects can be carried out. 

Numerical Example. Computational procedures for the plan to be 
described are similar to those used in a two-factor factorial experiment. 
The data in Table 10.5-1 will be used for illustrative purposes. Suppose 
that an experimenter is interested in evaluating the relative effectiveness of 
three drugs (factor B) on three categories (factor C) of patients. Patients 
for the experiment are obtained from three different hospitals (factor A). 

The experimenter obtains 12 patients from each of the hospitals. Of 
these 12 patients, 4 belong to category сі, 4 to category cs, and 4 to category 


сз. The total number of patients in the experiment is 36; the total number 
of patients in any one category is 12. j 
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The design for the experiment is the Latin square given at the left of part 
i. The first row of this Latin square indicates the manner in which the 12 
patients from hospital a, are assigned to the drugs. The four patients in 
category сз are given drug 5,; the four patients in category c, are given 
drug Рз; the four patients in category c, are given drug by. The criterion 


Table 10.5-1 Numerical Example 


Design: Observed data: 
Ирет b, by ba 
@) 
Cell totals 
bi bs by Total 
@ | 33 5 9 47-4, 
5 n=4 
(ii) a 11 14 43 68 = 4, 
p=3 
аз 6 9 6 21 = Аз 
Total| 50 28 58 136=G 
Bist Весы Be 
29 22 85 
(ее а аны СА 
ee А ТЕТО СРИНИН 
(1) = С?/пр* = 513.78 (4) = =B?/np = 554.00 
(2) = ХХ? --978.00 (5) = EC?[np = 712.50 
(3) = EA?[np = 606.17 (6) = S(ABC)*/n = 878.50 


(i) SSe = G3) — (1) = 9239 SShetween cens = (6) — (1) = 364.72 
SS, = (4) — (1) = 4022 55, ces = (2) — (6) = 99.50 
SS, = (5) — (1) = 198.72 
SS = (6) - (3) — (4) — (5) + 2(1) = 33.39 
EE MEN FRIES г 1 


scores for these three sets of four patients are given in the first row of the 
table of observed data. 

In part ii are the cell totals for the four observations within each cell of 
the design. The entry in row ау, column b,, which is 33, is the sum of the 
criterion scores for the four patients in category сз from hospital ay—these 
four patients were given drug Бу. This total will be designated by the sym- 
bol 4 BCas. Similarly, the entry in row а), column bs, which is 43, will be 
designated by the symbol A BC,,,—this total is the sum of the four observa- 
tions under treatment conditions abc,5.__ Row totals and column totals in 
part ii define, respectively, the A,’s and the Bjs. The C,’s are also obtained 
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from the cell totals. From the design, one locates cells at specified levels 
of factor C; thus, 

С,= 9--14--6--29, 

6,2 5 $11 +6 = 22, 

C, = 33 + 43 4-9 = 85. 

Computational symbols are defined in partiii. In terms of the symbols, 

the variation due to the main effect of factor A is estimated by 
SS, = np&(4, — G} = (3) — (1) = 92.39. 

The variation due to the main effect of factor B is estimated by 
SS, = npX(B, — Gy = (4) — (1) = 40.22. 

Similarly, the variation due to the main effect of factor C is estimated by 
SS, = npX(C, — 6) = (5) — (1) = 198.72. 

The residual sum of squares is computed in a somewhat indirect manner. 

The variation between the nine cells in the Latin square is 
SSnetween сеци = NZ(ABC; уь — б)" = (6) — (1) = 36472. 
This source of variation represents a composite of all factorial effects, main 
effects as well as interactions. There are eight p* — 1 degrees of freedom 
in this source of variation. The residual variation is given by 
SSres = SSpbetween cells — SS, тт SS, pt. SS, 
= (2) — 3) — (4) — (5) + 2(1) = 33.39. 

From this point of view, the residual is that part of the between-cell 
variation which cannot be accounted for by additivity of the three main 
effects. If no interactions exist, this source of variation provides an esti- 
mate of experimental error. If interactions do exist, the residual variation 
is, in part, an estimate of these interactions. 

The within-cell variation is estimated by 

$$. веш = Catenin ABC! = (2) - (6) = 99.50. 


m 


A summary of the analysis of variance is given in Table 10.52. — 
A partial test of the hypothesis that all interactions are negligible is given 


by 


= MSrese ШОШ роз 

MSy.cen 3.69 
Fora .05-level test, thecriticalvalueisF,,2,27) = 3.35. Hence the experi- 
mental data tend to contradict the hypothesis that the interactions are 
negligible. Under these conditions, the adequacy of the Latin-square 
design is questionable—estimates of the main effects may be confounded by 
interaction terms. 


F 
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If all interactions with the hospital factor are negligible, then the varia- 
tion due to the residual represents partial information on the drug by 
category interaction. Under this latter assumption, the main effects of 
the drug factor and category factor will not be confounded with inter- 
action effects. If the assumption is made that all interactions with the 
hospital factor are negligible, the experimental data indicate that the drug 
by category interaction is statistically significant. Tests on main effects 
for drugs and categories also indicate statistically significant variation. 


Table 10.5-2 Analysis of Variance for Numerical Example 


Source of variation SS df MS | F 
Hospitals (A) 92.39 2 46.20 12.52* 
Drugs (B) 40.22 2 20.11 5.45* 
Categories (C) 198.72 2 99.36 26.93* 
Residual 33.39 2 16.70 4,53% 
Within cell 99,50 27 3.69 

Total 464.22 35 | 


* F (2,27) = 3.35 


The test on the main effects due to hospitals cannot adequately be made 
since this source of variation is partially confounded with the drug by 
category interaction. If the hospital factor is a random factor, there will 
ordinarily be little intrinsic interest in this test. 

The experimental data do indicate statistically significant differences 
between the relative effectiveness of the drugs; however, in view of the 
presence of drug by category interaction, the interpretation of these dif- 
ferences requires an analysis of the simple effects. Information for this 
latter type of analysis is not readily obtained from a Latin-square design. 
Had the existence of an interaction effect been anticipated, a complete 
factorial experiment with respect to the interacting factors would have been 
the more adequate design. 

To show the relationship between the analysis of variance for a Latin 
square and the analysis for a two-factor factorial experiment, suppose factor 


A in Table 10.5-1 is disregarded. Then the analysis of variance is as 
follows: 


ee ee LÀ 


Source 55 df 
E (9-()- 402 | 2 
с (5) — (1) = 198.72 2 
B X(BC)? 

C a E O ORO 125.78 4 
Within cell (2) — (6) = 99.50 | 27 


— | ә 
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The two degrees of freedom for the main effect of factor A are part of the 
BC interaction of the two-factor factorial experiment. Thus, in a Latin 
square the interaction term is partitioned as follows: 


BC ss df 
A 92,39 | 2 
Residual |S$,, — 55, = 3339 | 2 _ 

125.78 | 4 


Plan 2. Use of a Latin square as part of a larger design is illustrated 
in the following plan: 


b by bs bi by by 

| с сз с ау | сү сз с 

di | @ | сү сз Ca d | a2 | с сі Cg 
аз | с сі в аз | с Cy с, 


Assume that there are л independent observations within each cell. АП 
observations in a single square are made at the same level of factor D. The 
separate squares are at different levels of factor D. In some plans it is 
desirable to choose a balanced set of squares, in others use of the same 
square throughout is to be preferred, while in still other plans independ- 
ent randomization of the squares is to be preferred. Guides in choosing 
the squares will be discussed after the analysis has been outlined. | 

A model for which this plan provides adequate data is the following: 


E(X ijmo) = H + 4) + Ву) + Ума) + Om + bitom + Бдхдт + удмә F resq). 


The subscript (s) indicates that an effect forms one of the dimensions of the 
square. It will be noted that no interactions are assumed to exist between 
factors that form part of the same Latin square (i.e., factors A, В, and С). 
However, interactions between the factor assigned to the whole square 
(factor D) and the factors that form the parts are included in the model. 
Assuming that all factors are fixed and that there are n observations in 
each cell, the appropriate analysis and expected values for the mean squares 
(as obtained from the above model) are outlined in Table 10.5-3. А partial 
check on the appropriateness of the model is provided by the F ratio 
з Mere 
MS. сеп 


When the model is appropriate, this ratio should not differ appreciably 


from 1.00. Wr 

Use of this experimental plan requires a highly restrictive set of assump- 
tions with respect to some of the interactions. Should these assumptions 
be violated, main effects will be partially confounded with interaction effects. 
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If, for example, it may be assumed that interactions with factor A are 
negligible, but that the BC interaction cannot be considered negligible, then 
the main effect of factor A will be confounded with the BC interaction. 
However, main effects of factors B and C will not be confounded. When 
interactions with factor A are negligible, partial information on the BC 
interaction is available from the within-square residuals. When such 
information is to be used to make inferences about the BC interaction, itis 
generally advisable (if the design permits) to work with a balanced set of 
Latin squares rather than to work with independently randomized squares. 


Table 10.5-3 Analysis of Plan 2 


Source of variation df df fongenergi E(MS) 

A 2 р-іІ оў + npqoi 
B 2 р-1 оў + прдоў 
© 2 p-1 оў} + прдоў 
р 1 4—1 o? + пр?) 
AD 2 (p—1(q—1) | o + npozs 
BD 2 (р —1)4—1) a? 4 проф, 
ср 2 (p —1)4—1) оў + про) 
Residual 4 gp = 0р – 2) | 02 + пс e 
Within cell 18(n — 1) Р4т—1) | e 


#"_———_.—_—_._—._..—= 


Use of a balanced set of squares will provide partial information on all the 
components of the interaction term. Use of the same square throughout is 
to be avoided in this context, since information on only a limited number of 
the interaction components will be available. 

Computational Procedures for Plan 2. Тһе computational procedures 
assume that there are q levels of factor D and n observations in each cell of 
Шер x p Latin squares included in the plan. The observed data will con- 
sist of g squares, one for each level of factor D; each square will contain np” 
observations. Hence the total number of observations in the experiment 
isnp*q. From the observed data, a summary table of the following form 
may be prepared (for illustrative purposes, assume that p = 3 and q = 2): 


| d, d, 
al Ар” ҰТА. А 
az AD», Ару 4» 
аз | Абу Абу; Аз 
D; Ds G 


Each of the 4.D,,,'s in this summary table is the sum of the np observations 
in the experiment which were made under treatment combination adim: 
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From this summary table, one may compute the following sums of squares: 


2 2 
SS А = 
пра пр? 
2 2 
| скаку 
np — npq 
ЖАБЫ) ААТ peas GÀ 
55, | ы 
пр пра пр? пр 


In a manner analogous to that by which the AD summary table is соп- 
structed, one may construct a BD summary table. From this latter table 
one computes SS, and SS,,. From a CD summary table one can compute 
SS, and SS,,. 

To obtain the variation due to the residual sources, one proceeds as 
follows: The Latin square assigned to level d,, will be called square m. 
The residual variation within square m is given by 


У (cell totals)? d р, 2] ХС ы j 262, 
np 


| SS.) = €? ar 
| п пр пр пр 


The notation > indicates that the summation is restricted to square т. 
The degrees of freedom for residual variation within square m are 
(p — (p - 2). 
The residual variation for the whole experiment is 
55. = XSSresw). 
The degrees of freedom for SS,es аге 
dfres = Х(р — 1)(p — 2) = 4(р — Up — 2). 


As a partial check on the assumptions underlying the analysis, the within- 
square residuals for each of the squares should be homogeneous. One 


may use an Ку test for this purpose. 


max 


The within-cell variation is given by 


X(cell total)? 
n 


where the summation is over the entire experiment. To check the assump- 
tion of homogeneity of error variation, the summation in the last expression 
may be restricted to the individual cells. Тһе qp* sums of squares obtained 
in this manner may then be checked for homogeneity by means of an Ку 
test. 


SSw. cen = ZX* — 
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As an additional check on the assumption of no interactions between the 
factors which form the dimensions of the Squares, one has the F ratio 


® "MSiss 


MSw. cell | 
When oz, = 0, this F ratio should be approximately unity. The residual 


variation is sometimes pooled with the within-cell variation to provide a 
pooled estimate of the experimental error, i.e., 


SSpootea error = SSres + SS, сеп. 


The degrees of freedom for SSyooted error 18 the sum of the degrees of free- 
dom for the parts. 

Plan 3. This plan resembles Plan 2 in form, but the assumptions 
underlying it are quite different. Here the treatment combinations in a 
р Хр X p factorial experiment are divided into a balanced set of p x p 
Latinsquares. The levels of factors A, B, and C are then assigned at ran- 
dom to the symbols defining the Latin square. Then the levels of factor 
D are assigned at random to the whole Squares. Hence the number of 
levels for each factor must bep. Anillustration in terms of a balanced set 
of 3 x 3 Latin squares is given below: 


bı ba b bi by by | by by bs 

бі | 64i Сс) с; ау | с, о с | Q | Cy C. Cg 

dı | ag | c cs c dy | as | cy сү с dy| ag} € € е 
% | 69 а с аз | € Са сз аз | с, Cs Сү 


In this plan all the 27 possible combinations of the factors A, B, and Care 
present. However, only one-third of the 81 possible combinations of the 
factors A, B, C, and D that would be present in the complete four-factor 
factorial experiment are present in the above plan. If interactions with 
factor D are negligible, then this plan will provide complete information 
with respect to the main effects of factors A, B, and C, as well as all two-fac- 
tor interactions between these factors. The main effect of factor D will be 
partially confounded with the 4 BC interaction; however, partial informa- 
tion with respect to the 4BC interaction may be obtained. 

This design is particularly useful when the experimenter is interested 
primarily іп а three-factor factorial experiment in which control with respect 
to the main effects of a fourth factor is deemed desirable. The model 
appropriate for this plan is the following: 


E(X istmo) = A T = 4- B; 4- Vet ef; + «у + Ву, + Om + «Ву 


The prime symbol on the three-factor interaction indicates only partial 
information. (This plan can perhaps be more accurately classified as а 
balanced incomplete-block design. Note that no interactions with factor 
D are included in the model; should such interactions exist, estimates of the 
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sources of variation due to factors A, B, C, and their two-factor interactions 
will be confounded with interactions with factor D. 

Assuming (1) that the model is appropriate for the experimental data, 
(2) that there are л observations in each of the p? cells in the experiment, and 
(3) that A, B, and C are fixed factors, an outline of the analysis and the 
expected values of the mean squares are given in Table 10.5-4. 


Table 10.5-4 Analysis of Plan 3 


Source of variation df df $2 se E(MS) 

A 2 p-l o? + пр?о? 
B 2 p i оў + пр?о? 
C 2 TI оў + np*o? 
AB 4 (p = 1)? оё + прозр 
АС 4 (p = 1)* oè + npo, 
BC 4 (p — 1) ог + проф, 
D 2 (р = 1) o? + пр?) 
(ABC) 6 (р —18 —(p—-0]| o} + no, 
Within cell 21(n — 1) pn = 1) в 


Computational Procedures for Plan 3. Because this plan includes a 
balanced set of Latin squares, one may construct an ABC summary table of 


the following form: 
by by bs 


сі Co єз e Cy єз сі 


a, | АВС, АВС ИВС, 
а» 
аз 

Each total of the form АВС, is based upon п observations. Ву means of 

the usual computational formulas for a three-factor factorial experiment 

having n observations per cell, one may obtain the sums of squares for all 


the factorial effects, including SSe- For example, 


The sum of squares due to the three-factor interaction, 55,,, includes the 
variation due to the main effect of factor D. The adjusted sum of squares 


for this three-factor interaction is given by 
n c) 
прі np/- 


55е = SSave — ( 
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55, includes that part of the ABC interaction which is not confounded 
with the main effect due to factor D. The degrees of freedom for SSj», are 
dfas;.. v3 «ТЕРЕ = dfss, 

-(р-1)-(р- 1). 
The within-cell variation is given by 
SS. УХ X(cell total)? ! 
п 


where the notation (cell total) represents the sum of the т observations 
in the experiment made under a unique combination of factors A, B, C, 
and D. The summation is over all cells in the experiment. 

Plan 4. This plan uses the treatment combinations making up a 
factorial set as one or more dimensions of a Latin square. From many 
points of view, Plan 4 may be regarded as a special case of Plan 1. А square 
of dimension p? х р? is required for this plan. It will be illustrated by the 
following 4 x 4 square: 


сақ сақа сау са, 


aby fa h 13 ц 
abis 13 % ц h 
абд ti L^ ty ty 
аб, ty ty h fy 


The treatment combinations along the rows are those forming a 2 x 2 
factorial experiment; the treatment combinations across the columns also 
form a 2 x 2 factorial experiment. The letters in the square may or may 
not form a factorial set. For illustrative purposes suppose that they do 
not. Assume that there are п observations in each cell. If the treatment 
sets which define the dimensions of the Latin square do not interact, then 
an appropriate analysis is given in Table 10.5-5. The expected values of 
the mean squares assume that A, B, C, and D are fixed factors. (Note 
that the factors within a dimension of the Latin square may interact.) 

Computational Procedures for Plan 4. To obtain SS,, SS,, and SS,,, the 
р? row totals are arranged in the form of an AB summary table. 


by b, 
a АВ, АВ! А, 
а АВ, АВ, As 
B, B, G 


Each AB, is the sum of np* observations. Hence each 4, total is based 
upon лр? observations. The variation due to the main effects of factor 


A is on 
MS еы 
55, == e, 
пр пр 
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The variation due to the AB interaction is 


(AB)? _ EA} УВ), б? 
np пр пр m 


The sums of squares $$, SS,, and SS,, are computed in an analogous 
manner from the column totals. 


SS, 


Table 10.5-5 Analysis of Plan 4 


Source of variation df df for general case E(MS) 
Row effects 3 pt 
A 1 pri 02 + onp*o 
B 1 pul оў + пр?ођ 
АВ 1 (р = 1? а? + прозр 
Column effects 3 pt 
© 1 pri оў + про); 
D 1 pd o? + np*os 
cD 1 (р = 1? оё + про) 
Letters in cells 3 p-1 
T 3 p~ o + пр?о? 
Residual 6 (p — 00р? — 2) | 9 + noie. 
Within cell 16(п — 1) pXn — 1) ot 


If the sum of the np? observations at level t, is designated by the symbol 
T,, then the variation due to the treatments assigned to the letters of the 


square is 

.zm c 

ЕТЕРІ 

The variation due to residual sources is 

У(сей total)? ХАВ)? _ E(C Dem)” 
n np? пр? 


2 26? 
ommo 


55, 


SSres = 


The within-cell variation is 
(сей total)? 


SSw. сеп = xx — " 


where the summation is over all cells in the experiment. 
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Summary of Plans in Sec. 10.5 


Plan 1 Plan 2 
bi by by bi ba b, | bi 5 by 
ау | Co сү с; а| с с с а |с с; с 
аз | з со c dı | ag | cy с с dı | aa | co а е 
аз | € с; Cy аз | Co сі сұ аз | Cg с; сү 
Plan 3 
by by by URNAM ER b 5 bs 
аа | € Cy с, 41 | с сз сү а | сз € Cy 
dı | az | с с сі dı | а, | сз с Co d3 | as | су с с 
аз | Cy сі б аз | € Cy с аз | Cy с с 
Plan 4 
сауу cdig са, са,» 
aby, ta h в ц 
abis ty ty ta қ 
абу ti fa fa fs 
abs 1 ty ti ty 


10.6 Analysis of Greco-Latin Squares 


In its classical context, a Latin-square arrangement permits a two-way 
control in variation of the experimental units, i.e., control of row and 
column effects. In a similar context, a Greco-Latin square permits a 
three-way control in the variation of the experimental units, i.e., row 
effects, column effects, and “layer” effects. Thus, a Greco-Latin square 
defines a restricted tandomization procedure whereby p treatments are 
assigned to p? experimental units so as to obtain balance along three 
dimensions, 

From the point of view of Construction, a Greco-Latin square is a com- 
Posite of two orthogonal Latin Squares. (Independent randomization of 


I п Composite 
а b c a В У ax bp cy 
b (6 a y x B by са ap 
Соса у еч ш cB ay bx 


An equivalent representation in terms of numbers rather than letters is 
given below. In the resulting composite Square the first digit in a pair 
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represents the level of one effect, and the second digit the level of a 
second effect. 


I п Composite 
1 2 1 2553 11 22 33 
2 3 1 3 1 2 23 31 12 
3 1 2 3 1 32 13 21 


In a Greco-Latin square, there are in reality four variables—namely, 
row, column, Latin-letter, and Greek-letter variables. From this point 
of view, a Greco-Latin square may be regarded as a kind of four-factor 
experiment. There are p? cells in this square; hence there are p? treatment 
combinations. In а four-factor factorial experiment in which each factor 
has p levels, there are p* treatment combinations. A Greco-Latin square 
may also be regarded as a р?/р* or 1/p* fractional replication of a p x 
p хр хр factorial experiment. 


Table 10.6-1 Analysis of Variance for Greco-Latin Square 


Source of variation df. df jop penera E(MS) 
A (rows) 2 p-1 оў + про? 
B (columns) 2 рі оў + npo; 
C (Latin letters) 2 p оў + про? 
D (Greek letters) 2 ja о? + проф 
Residual - (р — Wp —3)| 92 + поте 
Within cell 9(n — 1) pus) а? 

Total 9n —1 пр? —1 


_  _———————— 


Asa fractional replication of a factorial experiment, main effects of each 
of the factors will be confounded with two-factor and higher-order inter- 
action effects. For example, the main effects of factor A will be con- 
founded with the BC, CD, and BCD interactions. In general, the utility 
of a single Greco-Latin square is limited to experimental situations in 
which the four dimensions of the square have negligible interactions. 
However, Greco-Latin squares may be used to good advantage as part 
of more inclusive designs. The latter are illustrated in Sec. 10.7. If all 
interactions between factors defining the dimensions of a Greco-Latin 
square are negligible, then the analysis of variance takes the form given in 
Table 10.6-1. This analysis assumes that there are л independent obser- 
vations in each of the cells. The specific degrees of freedom are for the 
case of a 3 х 3 square. For this special case, the degrees of freedom of 
the residual variation are zero; only for the case p > 3 will the residual 
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variation be estimable. A partial check on the assumptions made about 
negligible interactions is given by 


Depending upon the outcome of this test, the residual sum of squares is 
sometimes pooled with the within-cell variation to provide an over-all 
estimate of the variation due to experimental error. 


Table 10.6-2 Computational Procedures for Greco-Latin Square 


(1) = Спр? (5) = (2Cp)/np 
(2) = =x? (6) = (ED$)/np 
(i) (3) = (2A})/np (7) = (сей total)?]/n 


(4) = (ХВ?)/пр 
———— —P—— ee «с; - 


A (3) - (1) 

B (4) — (1) 

А С (5) — (1) 

(ii) D (6) — (1) 
Residual (7) — (3) — (4) — (5) — (6) + 3(1) 

Within cell (2) - (7) 

Total Q) — (1) 


———————— 


A summary of the computational procedures is given in Table 10.6-2. 
These procedures differ only slightly from those appropriate for the Latin 
square. The residual variation is obtained from the relation 


SSres = SSpetween сепа — SS, — SS, — SS, — SS,. 


Residual variation includes interaction terms if these are not negligible. 
Otherwise the residual variation provides an estimate of experimental error. 


10.7 Analysis of Latin Squares—Repeated Measures 


In the plans that are discussed in this section, all the restrictions on the 
model underlying a repeated-measure design for a factorial experiment are 
necessary in order that the final F tests be valid. These restrictions Were 
discussed in Chap. 7. Special attention should be given to the possible 
presence of nonadditive sequence effects in experiments which do not 
involve learning. In particular, a repeated-measure design assumes that 
all pairs of observations on the same subjects have a constant correlation. 
If this assumption is violated, resulting tests on within-subject effects tend 
to be biased in the direction of yielding too many significant results. 

The equivalent of the conservative test proposed by Box (1954) and by 
Greenhouse and Geisser (1959) can be adapted for use in connection with 
the designs that follow. For example, if the test for a within-subject main. 
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effect (as indicated by the model requiring the homogeneity-of-correlation 
condition) requires the critical value 
F,_.[(p — 1), p(n — 1)(р — 1)], 
then the corresponding critical value under the conservative test procedure 
is 
F,-4[1, p(n — 1)]. 

In principle, the degrees of freedom for the numerator and denominator of 
the F distribution required in the usual test (as given by the expected values 
of the mean squares obtained from the restricted model) are divided by the 
degrees of freedom for the factor on which there are repeated measures. 

Plan 5. Consider the following 3 x 3 Latin square in which groups of 
n subjects are assigned at random to the rows: 


ay а аз 
б, bs ы by 
Gal by by 
б; by bs ы 


In this plan each of the n subjects іп G, is observed under all treatment com- 
binations in row 1, that is, аб, з, абу, Ры. For example, the levels of factor 
A may represent three kinds of targets, and the levels of factor B may repre- 
sent three distances. There are nine possible treatment combinations in the 
complete 3 х 3 factorial experiment; each of these nine appears in the Latin 
square. However, the individuals within any one group are observed only 
under three of the nine possibilities. Suppose that the order in which a 
subject is observed under each treatment combination is randomized in- 
dependently for each subject. E { 

If the interactions with the group factor are negligible, the following 
model will be appropriate for the analysis (this assumption is reasonable 
if the groups represent random subsamples from a common population): 

E(Xijem) = й + Ôr ть + 9 + В, + ois. 
In this model д, represents effects associated with the groups and тц) 
effects associated with subjects within the groups. The symbol ару indi- 
cates that only partial information is available on this source of variation. 
Assuming that factors 4 and B are fixed factors, the analysis and the 
expected values of the mean squares are given In Table 10.7-1. 

In this analysis, only (p — 1)(p — 2) degrees of freedom for the E 
interaction appear as within-subject effects. The missing p — 1 degrees o 
freedom define the variation among the groups. Since differences among 
the groups, in part, reflect differences due to the effects of various combina- 
tions of 4 and B (which are balanced with respect to main effects), such 
differences define part of the 4B interaction. It is readily shown that 


SS.) = SSgroups + 55%» 


540 


where SS,, is the variation due to the 4B interaction as computed in a two- 
factor factorial experiment. From some points of view, SS,roups May be 
regarded as the between-subject component of the AB interaction, For 


most practical purposes, only SS;, (the within-subject component) is tested; 
tests on the latter component will generally be the more powerful. 
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Table 10.7-1 Analysis of Plan 5 


Source of variation df | df for general case | E(MS) 

Between subjects Зп —1 np -1 

Groups 2 p-1 аё + po? + про? 

Subjects within groups 3(n — 1) p(n — 1) o? + ро? 
Within subjects 6n np(p — 1) 

A 2 p-1 | o2 + про 

B 2 Р—1 ор + праў 

(AB) 2 (p — 1Xp —2) оў + nog 

Error (within) 6n —6 р(п – 1)(р – 1) | о 


! 
oe eee СИНЕ — 

The appropriateness of the model should be given serious consideration 
before this plan is used for an experiment. A possible alternative plan, 
which requires the same amount of experimental effort but does not utilize 
the Latin-square principle, is the following: 


| а а аз 
су b, bi b, 
Gs by by ba 
Gs bs bs 8 


Again the groups are assumed to be random samples of size n from a com- 
mon population. In this plan each of the n subjects in group 1 is observed 
under the treatment combinations Gb, аб, and абы. Assume that the 
order in which subjects are observed under each treatment combination is 
randomized independently for each Subject. In this plan there are repeated 
measures on factor A but no repeated measures on factor B. A model 
appropriate for estimation and tests under this plan is 
E(X isem) = w+ By + ты) F ai + fl; + Tas). 

For this model, assuming that factors 4 and B are fixed, the analysis is 
outlined in Table 10.72. In contrast to Plan 5, under this plan the AB 
interaction is not confounded with between-group differences; however, 
all components of the main effect of factor B are between-subject com- 
ponents. Thatis, differences between levels of factor B are simultaneously 
differences between groups of people. Hence use of the Latin square 
permits a more sensitive test on the main effects of factor B, provided that 


| 
| 
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the model under which the analysis is made is appropriate. In general 
the A x subjects within groups interaction under the alternative plan will 
be of the same order of magnitude as the error (within) term in Plan 5. 

If the experimenter’s primary interest is in factor A and its interaction 
with factor B, and if there is little interest in the main effects of factor B, use 


Table 10.7-2 Analysis of Alternative to Plan 5 


Source of variation df df for general case E(MS) 

Between subjects Зп —1 пр -1 

В 2 pal оё + рап + праў 

Subjects within groups 3(n — 1) p(n — 1) оў + poi 
Within subjects 6n np(p — 1) 

A 2 p-1 оў + о, + про? 

АВ 4 (р—1(р—1) | eei, подр 

A x subjects within 


groups 6n —6 pn — Xp = 1) оў + oi, 


of the Latin-square plan is not recommended. However, if the main effects 
of factor B are also of primary interest to the experimenter, use of the Latin 
square has much to recommend it. By suitable replication, some within- 
subject information may be obtained on all components of the AB inter- 
action. Plan 5 forms a building block out of which other plans may be 
constructed. 

Computational Procedures for Plan 5. Computational procedures for 
this plan are similar to those given for Plan 1. The sources of variation 
SS,, SS,, and SSyroups are computed in a manner analogous to those used in 
Plan 1 for corresponding effects. The variation due to the within-subject 
components of the AB interaction is given by 


SS/, = 58, — SSerouns, 
where SS,, is computed from ап AB summary table by means of the usual 


computational formulas for this interaction. am 
The variation due to subjects within groups 1s given by 
EP ma) a LG, 
np i 

where Pma represents the sum of the p observations on person m in group 
k, and where G, represents the sum of the np observations in group k. 
The summation is over the whole experiment. If the summation is limited 
to a single group, one obtains SS, w. group & There will be p such sources 
of variation. It is readily shown that 


SSsubj w, groups = 228550) w. group t- 


SSsunj w. groups — 


542 LATIN SQUARES AND RELATED DESIGNS 


That is, the sum of squares on the left represents a pooling of the sources 
of variation on the right; the latter may be checked for homogeneity by 
means of an Ку test. 

It is convenient to compute SScrronwithin from SS, (1. The latter is given 
by 

=(AB,,)? 
SSy, cou = 2X" — AB) 
n 

where AB,, represents a cell total. The summation is over the whole 
experiment. From this source of variation, 


SSerror(within) = 85%, сеп — SSsunj w. groups. 
A somewhat modified computational procedure provides the parts of the 
error term that may be checked for homogeneity. One first computes 


У 2 
55%, сеп for б, == Ух? = Ану , 
п 


where the summation is limited to those cells in which group (с participates. 
The degrees of freedom for this source of variation are p(n —1) Then 


SSerror (within) for Gy = SSw. сеп for ay — SSauvjw. Gy. 
Degrees of freedom for this source of variation are pin — 1) —(n—}), 
which is equal to (n — 1)(р — 1). It is readily shown that 
SS ror (within) = ESS сок (within) for бу? 


that is, the variation on the left is a pooling of the sources of variation on 
the right. The degrees of freedom for the pooled error (within) and the 
degrees of freedom of the parts are related as follows: 


dferror (within) = а — Dp — 1) = p(n — 1)(р — 1). 


Thus error (within) is partitioned into p parts; each of the parts has 
(n — 1)(р — 1) degrees of freedom. 

Illustrative Applications. Staats et al. (1957) report an experiment which 
may be represented schematically as follows: 


а а 

Базы езана ФМ 
G, bi by 
с, by b, 


In this experiment the levels of factor A represent two different words. The 
levels of factor B represent two different kinds of meaning associated with 
the words. Subjects were assigned at random to each of the groups. The 
criterion measure was a rating of the “pleasantness” of the word after a 
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series of associations defined by the level of factor B. The analysis of 
variance had the following form: 


df MS F 
Between subjects 159 
Groups 1 0.00 
Subj w. groups 158 2.30 
Within subjects 160 
A Words 1 45.75 18.67 
В Meaning 1 22,58 9.22 
Error (within) 158 2.45 


In this design, differences between groups may be considered as part of the 
AB interaction. 

Martindale and Lowe (1959) report an experiment which is also a special 
case of Plan 5. In this case, a 5 x 5 Latin square is central to the plan. 
The levels of factor A represent different angles for a television camera; the 
levels of factor B represent trials. There were three subjects їп each of the 
groups. Subjects were required to track a target which was visible to them 
only through a television screen. The criterion was time on target. The 


analysis of variance had the following form: 


df MS F 
Between subjects Uu 
Groups 4 5,000 5.03 
Subj.w. groups 10 995 
Within subjects 60 
“A Angles — 4 15102 | 38.82 
B Trials 4 167 
Pooled error 52 389 
(AB) 12 597 
Error (w) 40 326 


1 поула ego SENE MEE: 


The authors refer to (4)' as the residual from the Latin square; the error 
(within) is called the residual within the Latin square. These latter sources 
of variation are pooled to form the within-subject error. Я 

Plan 6. In Plan 5, the groups assigned to the rows of the Latin square 
form a quasi factor. Essentially, Plan 5 may be regarded as one complete 
replication of a two-factor factorial experiment arranged in incomplete 
blocks. From this point of view, the plan to be discussed in this section 
may be regarded as a fractional replication of a three-factor factorial 


experiment arranged in incomplete blocks. 
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In this plan each subject within G, is assigned to the treatment combina- 
tions абсууу, abcss,, and аБсз. Thus each subject in G, is observed under 
all levels of factors A and B but under only one level of factor C. For each 
subject there is balance with respect to the main effects of factors А and B, 
but there is no balance with respect to any of the interactions. Similarly, 
each subject in G, is assigned to the treatment combinations abciso, abCo49, 


| а а аз 
Ga ШЕЛ bs by 
G, а | ba b, by 
Gy Wer | қ bs b, 


and абсзз. Again there is balance with respect to the main effects of 
factors A and B, but there is no balance with respect to interactions. 


Table 10.7-3 Analysis of Plan 6 


Source of variation df df for general case E(MS) 

Between subjects 3n —1 пр -1 

F 2 р-1 92 + po? + проф 

Subjects within groups 3(n — 1) p(n — 1) оў + po? 
Within subjects 6n np(p — 1) 

A 2 р-і оў + npo? 

B 2 p-1 оё + про) 

Residual 2 (p — (р — 2) о? + Noes 

Error (within) бп — 6 pi — 1)(р — 1) oè 


, If all interactions are negligible relative to main effects (a highly restric- 
tive assumption), the following model is appropriate for making estimates 
and tests in the analysis of variance: 


E(X sem) = U + yy + 750) + tit) + Bj) + rese. 
The analysis of variance and the expected values of the mean squares are 
summarized in Table 10.7-3. 

The analysis of Plan 6 is quite similar to the analysis of Plan 5. Dif- 
ferences among groups in the latter plan Correspond to differences due to 
the main effects of factor C in the present plan; what was part of the AB 
interaction in Plan 5 is a residual term in the present plan. It is of interest 
to note the relationship between the analysis of a two-factor factorial 
experiment which does not have repeated measures and the analysis of 
Plan 6. Тһе total number of observations in Plan 6 is the same as that 
of a two-factor factorial experiment in which there are л observations рег 
cell. The Latin-square arrangement in Plan 6 permits the partition of 
what is formally a two-factor interaction into a main effect and a residual 
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effect. The latter is a mixture of interaction effects if these exist. The 
fact that there are repeated measures in Plan 6 permits the within-cell 
variation to be partitioned into one part involving differences between 
subjects and one part which does not involve differences between subjects, 
provided of course that homogeneity of covariances exists. 


df Two-factor experiment | Р!ап 6 df 

SENI A | A | poi 

ip B B Teal! 

C p-! 

(р = 00р = 1) АВ | 
T | (Residual (p - 10р = 2) 
| Subj w. gp pn — 1) 
p(n — 1) Within cell | | 

Error (within) ра = Xp = 1) 


Because of the highly restrictive assumptions with respect to the inter- 
actions, use of Plan 6 is appropriate only when experience has shown that 
such interactions are negligible. A partial check on this assumption is 
given by the ratio 

MSres 


EN MSerror(within) i 
According to the model under which the analysis of variance is made, Ore 


should be zero when the assumptions with respect to the interactions are 
satisfied. In cases in which one of the factors, say, A, is the experimental 


F 


Table 10.7-4 Complete Factorial Analogue of Plan 6 


a, аҙ аҙ a, аз аз а 


variable of primary interest and factors В and С аге of the nature of con- 
trol factors (i.e., replications or order of presentation) this plan is poten- 
tially quite useful. With this type of experimental design interactions can 
frequently be assumed to be negligible relative to main effects. me 

In exploratory studies in which interaction effects may be o ы 
interest to the experimenter, there is generally no substitute for the complete 
factorial experiment. The complete factorial analogue of Plan 6 is repre- 
sented schematically in Table 10.7-4. Іп order to have complete within- 
subject information on the main effects of factors A and B as well as com- 
plete within-subject information on all interactions, including interactions 
with factor C, p? observations on each subject are required. In Plan 6, 
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only p observations are made on each of the subjects. Use of Plan 6 
reduces the over-all cost of the experiment in terms of experimental effort— 
at the possible cost, however, of inconclusive results. Should the experi- 
menter find evidence of interaction effects which invalidate the analysis of 
Plan 6, he may, if his work is planned properly, enlarge the experiment 
into Plan 8а. It will be noted that the latter may be constructed from a 
series of plans having the form of Plan 6. The assumptions in Plan 8a 
are less restrictive with respect to interactions than are those underlying 
Plan 6. 

Computational Procedures for Plan 6. Computational procedures for 
this plan are identical to those outlined for Plan 5. Here SS, = SS. сше 
and 55), = $$. All other sums of squares are identical. Tests for 
homogeneity of error variance are also identical. 

Plan 7. This plan is related to Plan 5 as well as to Plan 6. Plan 7 may 
be regarded as being formed from Plan 5 by superimposing an orthogonal 
Latin square. This plan may also be viewed as a modification of Plan 6 
in which the C factor is converted into a within-subject effect. The com- 
binations of factors B and C which appear in the cells of the following plan 
are defined by a Greco-Latin square: 


ау а аз 
с, bey bes, bcag 
Go Әсә bes bci 
Gz besg ӛсіз bes, 


The groups of subjects are assigned at random to the rows of the square; 
the levels of factor 4 are also assigned at random to the columns of the 
square. The subjects within G, are observed under treatment combina- 
tions abc, аБсз»з, and абсззз. (Assume that the order in which a subject 
is observed under a particular treatment combination is randomized in- 
dependently for each Subject.) Provided that all interactions are negli- 
gible, unbiased estimates of the differential main effects of factors A, B, 
and C can be obtained, Further, the expected values of the mean squares 
for these main effects will not involve a between-subject component. The 
model under which the analysis of Plan 7 can be carried out is 


E(X inmo) = n + бу + ту + а) + В + Vri- 


The symbol б, designates the effect of group т, and the symbol тм) 
designates the additive effect of person о within group т. 

_An outline of the analysis, assuming that the model is appropriate, is 
given іп Table 10.7-5. Fora3 x 3 square, the variation due to the residual 
cannot be estimated, since there are zero degrees of freedom for this source 
of variation. However, for Squares of higher dimension a residual term 
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will be estimable. The latter represents the variation due to interactions, 
if these exist. The variation between groups may also be regarded as part 
of the interaction variation. 

If the levels of factor C define the order in which subjects are observed 
under combinations of factors A and B, then Plan 7 becomes a special case 


Table 10.7-5 Analysis of Plan 7 


Source of variation df df for general case E(MS) 

Between subjects sn np —1 

Groups 2 Р'—1 оў + po? + про) 

Subjects within groups | 3(» — 1) рт — 1) оў po 
Within subjects 6n np(p — 1) 

A 2 p-1 0% + прод 

В 2 p-1 оў + проў 

С 2 m o? + про? 

Residual 0 (p—1Xp —3) | o? + nors 

Error (within) 6n —6 po — 1)(р — 1) оз 


of Plan 5. In terms of factor С as ап order factor, the schematic repre- 
sentation of Plan 7 given earlier in this section may be reorganized as 


follows: 
Order 1 Order2 Order 3 


б, aby, abgg аба 
Gs аба abis абу 
Gs by, аба абз 


Thus the subjects within G, are observed under treatment combinations 
аб, аБзз, and abs, in that order. With one of the factors representing an 
order factor, the variation between groups may be regarded as representing 
a sequence effect; this latter source of variation may also be regarded as part 
of the confounded interaction effects. қ 

Computational Procedures for Plan7. Тһе computational procedures for 
this plan do not differ appreciably from those of the Greco-Latin square. 
Following the computational procedures for the latter, one obtains SS,, 
55,55; апа 58 ш- Lhe between-cell variation is given by 


55 e X(celltota? — G? р 
between cells n np? 
From this sum of squares one obtains 


SSres = SSpetween cells — (SS, ЕР 55, T SS, m SSaroups)- 
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From this last relationship, the degrees of freedom for the residual source of 
variation are 


dfres = (р? — 1) — 4p — 1) = (p — Dp + 1 — 4) 
= (p — 1/р — 3). 
The variation due to subjects within groups is 


2 2 
SSsubj w. groups — >Р € an > 
р пр 
where Pm is the sum of the p observations on person m in group k, and 
where б, is the sum of the пр observations in group k. Тһе error (within) 
variation is obtained indirectly from the within-cell variation. The latter 
is given by 


X(cell total)? 
PES RED 


SSw. сеп = УХ? — 


where the summation is over the whole experiment. From this one obtains 
SSerror(witnin) Vd SSw. cell — SSsup; w. groups: 


This method of computing this source of variation shows that the degrees 
of freedom for error (within) are 


dferror(within) = p(n = 1) rat p(n = 1) 
= p(n — 1)(р — 1). 

The sum of squares for error (within) may be partitioned by the pro- 
cedures given under Plan 5. That is, by limiting the summations for 
SS, cons апа 85, w. groups tO Single groups, one may obtain p terms each 
of the general form SSerror (within) for съ: The degrees of freedom for each 
of these terms are (n — 1)(р — 1). 

Illustrative Application, Leary (1958) reports a study using a plan closely 


related to Plan 7. This experiment may be represented schematically as 
follows: 


Square Y Square M 
Day Day 
Subject .———— — Subject 
1 2 3 4 1 2 3 4 


1 (П) Q2) (33) (44) 5 (33) (41) (14) Q2 

2 (23) (4) (4) (32) 6 (12) (24) (31) (43) 

3 (34) (43) (12) (21) 7 (21) (13) (42) (34) 

^ |UD (31) (24) (13) 8 | 44) (32) Q3 (1) 
кесе Se ae 


The basic plan consists of two different Greco-Latin squares. The symbol 
(23), for example, represents an experimental condition having reward level 
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2 for list 3—a list being a set of pairs of objects. The analysis of variance 
for this experiment had the following form: 


Between subjects 


Squares 
Subjects within squares 


IR aca 


Within subjects 

Days 

Lists 

Rewards 

Pooled residual 
Days x squares 
Lists x squares 
Rewards x squares 
Res from square I 
Res from square II 


Awww 


ч чы ко C2 G2 


In this design there is only one subject per cell, but there are two 
replications. Since the squares are considered to define a random factor, 
the interactions with the squares are pooled to form part of the experi- 
mental error. The residual within square I is formally equivalent to the 
interaction between any two of the dimensions of square I minus the sum 
of the main effects of the other factors in the square. Thus, 


SSiestrom t == SSaays xlists — (SSrewaras + SSsunj м. р). 


In each case the summations are limited to square І. This residual term is 
an estimate of experimental error if the assumptions underlying the model 
are valid. Similarly the residual from square IL is an estimate of the experi- 
mental error. An Ру test may be used as a partial check on the pooling 


of the various estimates of experimental error. 
In terms of the analysis outlined in Table 10.7-5, the degrees of freedom 


for the pooled residual in this design are 
(p — 1)(р — 3) + p(n — 1)(р — 1) = GXI) + Җ1)(3) = 15, 


where n in this case is the number of replications. A и 
Plan 8. This plan uses Plan 5 as a building block. Disregarding the 


repeated-measure aspect, this plan also resembles Plan 2. А schematic 
representation of Plan 8 is given below: 


I II 


а а аз а а а 

с b. b. b; б, by by ы 

«| ep pones чене c. у 
G, b, b. bs 

б; bs ы by 6 з 1 2 
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In general there will be q squares, one for each level of factor C. Different 
Squares are used for each level of factor C. In square I, all observations are 
at level сі; in square П, all observations аге at level су. There is no restric- 
tion on the number of levels of factor C. 

Depending upon what can be assumed about the interactions, different 
analyses are possible. If the interactions between factors A , B, and groups 
(the dimensions of the squares) are negligible, then unbiased tests can be 
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Table 10.7-6 Analysis of Plan 8 
ee ыы 


Source of variation df 9 гоа E(MS) 
Between subjects бл —1 npq — 1 
с 1 4—1 02 + po + проў + npo 
Groups within C 4 «(p — 1) o? + poi + проў 
Subjects within groups | 6(n — 1) pan — 1) оў + poi 
Within subjects 12n тра(р — 1) 
A 2 p-1 oè + npqoà 
B 2 р-і о? + прдо3 
AC 2 (р — 1g – 1) | о + про? 
BC 2 (p —1)$ — 1) | e? + прод, 
Residual 4 «(p - 1)(р — 2) | o? + по?,, 
Error (within) 12n - 12 | pq(n — Dp-n|c 


made on all main effects and the two-factor interactions withfactor C. The 
model under which the latter analysis is made is the following: 


ЕХ) = B + Ye + бу + Totem + «уу + буш + aya + BY in: 


The analysis outlined in Table 10.7-6 follows from this model; the expected 
values for the mean Squares assume that factor C is fixed. It is also 
assumed that the order of administration of the treatment combinations is 
randomized independently for each subject. 

The assumptions underlying Plan 8, which lead to the analysis sum- 
marized in Table 10.7-6, are different from the assumptions used in the 
analysis of Plan 5. A set of assumptions, consistent with those in Plan 5, 
leads to the analysis for Plan 8. Тһе latter plan has the same schematic 
representation as Plan 8. However, in Plan 8 only the interactions in- 
volving the group factor are assumed to be negligible. Under these latter 
assumptions, the appropriate model is the following: 


ЕХ.) = u + 0, + Tolem) F % + Bs + efi, + оу, + Вул- 


The analysis resulting from this model, assuming factors А and В fixed, is 
outlined in Table 10.7-7. 
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Table 10.7-7 Analysis of Plan 8 


5 f variati df for general 
ource of variation df Cans E(MS) 
Between subjects бл —1 mpq — 1 T - Ж 
е 1 4-1 9; + po? + про) + пра? 
Groups within C 4 4р = 1) оў + ро? + проў 
Subjects within groups | 6(n — 1) pan — 1) оў + ро? 
Within subjects 12n пра(р — 1) 
А 2 p-! в} + npqoz 
B 2 р-і 9? + npqoj 
AC 2 (p -1)4-1) | of + прод, 
BC 2 (p = 04 —1) | e + пров, 
AB' from square I 2 (p = p —2) | e? + но; 
AB' from square II 2 (p — Vp —2) | e? + nois 


Error (within) 12n - 12 | pq(n -- (p — 1) оў 


If the number of levels of factor C is equal to p, and if a balanced set of 
squares is used, then the components of the AB interaction which are con- 
founded will be balanced. However, if the same Latin square is used 
for each of the levels of factor C, the same set of p — | degrees of freedom 
of the AB interaction is confounded in each square. It is of interest to 
compare the partition of the total degrees of freedom in a 3 x 3 x 3 


Table 10.7-8 Partitions in Plan 8a and 3 x 3 x 3 Factorial Experiment 


3 x 3 x 3 factorial Plan 8a 
df Source of variation Source of variation df 
2 A A 2 
2 B B 2 
2 (е) с 2 
4 АС АС 4 
4 ВС ВС 4 
4 АВ Groups within C 6 
8 ABC X(AB' for each square) 5 P 
ithi Subjects within groups л — 
ы) Within cell Error (within) 18(n — 1) 
01.0 1 оиша п бараана 


factorial experiment with the partition made under Plan 8a, assuming that 
a balanced set of squares is used in the construction of the design. These 


partitions are presented in Table 10.7-8. | ? 
The 12 degrees of freedom corresponding to the AB and ABC inter- 


actions in the factorial experiment appear in Plan 8a as the sum of 6 degrees 
6 degrees of freedom for what is 


of freedom for groups within C and 
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partial information on the simple interaction of AB at each level of су. In 
the general case, the relationship is as follows: 


df df 
(p — 1% АВ) {Groups within C pp — 1) 
(p — 18 ABC) \У(АВ' for each square) Рр - 10р — 2) 
Рр — 1) pp — 1) 


In order to have complete within-subject information on all effects, 
except the main effects of factor С, р? observations on each subject are 
required. In Plan 8a (constructed with a balanced set of squares) only p 


Table 10.7-9 Summary of Computational Procedures for Plan 8 
= ————==——....... 


(1) = СҮтрч (6) = [5(АСд°}лр 

0) = хл? (1) = 2BCy,)*I/np 
© 0) = (ЕАфулрд (8) = ХР 

(4) = (®В?упрд (9) = [X(cell total)?]/n 

© = ECD (10) = (263,)/npq 


(In each case the summation is over the whole experiment.) 


SSpetween subjects = (8) — (1) 
SS, = (5) -(1) 
SSproups м.С = (10) ly (5) 
SSsupj w. groups = (8) — (10) 


55, subj = (2) — (8) 
Gi) SS, = (3) — (1) 
SS, = (4) — (1) 
55,, = (6) - (3) - (5) + (1) 
55,, = (7) — (4) — (5) + (1) 
SSres = (9) - (10) — (6) - (7) + 2(5) 


SSerror (within) = (2) — (9) — (8) + (10) 


observations are required on each subject. What is lost in reducing 
the number of observations per subject from p? to p is partial information 
on the AB and ABC interactions, specifically p(p — 1) of these degrees of 
freedom. This loss in information is, in many cases, a small price to pay 
for the potential saving in experimental feasibility and effort. d 

, In Plan 8a, the source of variation SS;;, which is actually part of the 
simple AB interaction for each of the levels of factor C, may be tested 
separately or as a sum. Asa sum, this source of variation is a mixture 
of the AB and ABC interactions, 

Computational Procedures Jor Plan 8. Computational procedures for 

this plan are summarized in Table 10.7-9. These same procedures may also 
be followed for Plan 8a. In the latter plan, 55, becomes XSS;, for each 
square. 
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In part i of this table symbols convenient for use in the computational 
formulas are defined. Computational formulas are given in partii. The 
order in which the sums of squares are given in part ii corresponds to the 
order in Table 10.7-6. For purposes of checking homogeneity of the parts 
Of SS.rror(withiny» this source of variation may be partitioned into рф non- 
overlapping parts—one part for each of the pg groups in the experiment, 
The part corresponding to G,, is 


SSerror(within) гога, = (28m) — 98m) — (88m) + (102), 


where (2g,,) designates the numerical value of (2) if the summation in the 
latter is restricted to G,,. Analogous definitions hold for the other com- 
putational symbols containing gm. Itis readily shown that 


SSerror(within) е ESSerror(within) for б. 


The Аах Statistic may be used to check on the homogeneity of the parts that 
are pooled to provide the over-all error term for the within-subject effects. 


It is readily verified that 
X(28m) = (2),  X0g,-—() «іс. 


The variation due to subjects within groups may also be checked for 
homogeneity. The sum of squares 55, w. groups may be partitioned into 
pq parts, one part for each group. Each part has the general form 


SSsubjw.¢,, = (Bgm) — (10gm), 


where (8g,,) is the numerical value of (8) if the summation is restricted to 
group m. А 

If а balanced set of squares is used іп the construction of Plan 8а, the 
computational procedures may be simplified. One may use the regular 
computational procedures for a three-factor factorial experiment having 
п observations per cell as the starting point. In addition one computes 
SSyrouns w. c and 56, w. groups by means of the procedures given for Plan 8. 
Then 

Х(55), for each square) тт SS. F SSave к=, SSgroups м. Cy 


SSerror(within) = SSw. сеп SSsubj w. groups. 


Illustrative Application. Conrad (1958) reports an experiment that may 
be considered a special case of Plan 8. The purpose of this experiment was 
to investigate the accuracy with which subjects performed under four 
methods for dialing telephone numbers. There were 24 subjects in the 
experiment. Rather than assigning groups of 6 subjects to the rows of a 
single 4 x 4 square, individual subjects were assigned at random to six 
different 4 x 4 squares. The squares thus represented different sets of 
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sequences under which the subjects used the methods. The analysis of 
variance for the Conrad data was as follows: 


аш | MS | ЕЖ 

Between subjects 23. | | 
C Squares 5 2,446 

Subj w. squares 18 28,651 
Within subjects — | 72 
A Methods 3 3,018 | 9.61 
B Order SAN TOS 3.52 
AC | 45 197 
BC jer 215 215 
Residual 36 314 


Plan 9. This plan may be viewed as a special case of Plan 8a in which 
the same square is used for all levels of factor C. Hence the same com- 
ponents of the AB interaction are confounded within each of the squares, 
and the same components of the ABC interaction are confounded in dif- 
ferences between squares. Full information will be available on some 
components of both the АВ and ABC interactions. The number of levels 
of factors A and B must be equal; there is no restriction on the number of 
levels of factor C. Factors A, B, and C are considered to be fixed; inter- 
actions with the group factor are assumed to be negligible. A schematic 


representation of this plan for the case in which р = 3 and q = 3 is given 
below: 


а Ф% 4 а, аа а [a аз az 

G,| b. bs b cll КАЕ? 

а | Ga| bi by b, ca | G; | by be bg C | Ga | bı ba bs 
G,| by b, b, GP ы 3; | Go | by by by 


It is of considerable interest to indicate the manner in which the AB 
and ABC interactions are confounded. A given row in each of the above 
Squares represents the same combination of treatments A and B. For 
example, the first row in each square involves абу, abs, and абу. A 
summary table of the following form will be shown to have special mean- 
ing in the interpretation of the AB and ABC interactions: 


Row сі Cy са Total 
1 С, с, с, R, 
2 б, G; Gs Ry 
3 б; б; бұ Кз 
С, с, Cs Grand total 


—————— — — —ÁÉÁ—— 
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In this summary table each G designates the sum of the np observations 
made within a group. 

The total R, represents the sum of the npq observations in row 1 of all 
squares; this sumis also С, + б, + б). This totalis balanced with respect 
to the main effects of factors A, B, and C; it is also balanced with respect to 
the ABC interaction. The latter balance follows from the fact that sums 
of the form 

«Ву + «Ву» + «буз = 0, 
ауаз + оВузз + ®Вуззз = 0. 


However, the total R, is not balanced with respect to the AB interaction, 
since 
abia + аф + аа 5 0. 


By similar reasoning each of the other R’s may be shown to be balanced 
with respect to all main effects as well as the ABC interaction; however, 
each of the other R’s is not balanced with respect to the AB interaction. 
Thus, variation due to differences between the R’s represents two of the 
four degrees of freedom of the AB interaction. In general such differences 
represent p — 1 of the (p — 1)” degrees of freedom of the AB interaction. 

Since differences between the rows define p — 1 degrees of freedom 
of the AB interaction, the row x C interaction will define (p — 1)( — 1) 
degrees of freedom of the AB x C interaction. The latter interaction 
is equivalent to the ABC interaction. Hence (p — 1)(9 — 1) of the 
(p — 1)(р — 1)(q — 1) degrees of freedom for the ABC interaction are 
confounded with row effects. The remaining degrees of freedom are not 
confounded with row effects; that is, (p — 1)(p — 2)(q — 1) degrees of 
freedom of the ABC interaction are within-subject effects. 

In some texts the following notation has been used. 


SSrows = SSav(netween)s 
SSrow x с = SSabetbetween). 
In terms of this notation, 
55, = SSad(between) + SSan(within)- 
The corresponding degrees of freedom are 
(p — 1)(р — 1) = (p — D + (p — 1)(р— 2). 
The three-factor interaction takes the following form: 


55, = SSabecvetweeny + SSabecwithin), 
(p — Dp — 1)(4 — 1) = (p — 1)(4 — 1) + (p — DP — 2X4 — 1). 


In each case the "between" component is part of the between-subject 
variation, and the “within” component is part of the within-subject effects. 
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The model under which the analysis of variance for this plan may be 
carried out is 


E(X seme) = + уь + (гом), + (y X гоз) + Tolm) + di + bi + а); 

+ «у А буу + «Ву 
The analysis corresponding to this model is summarized in Table 10.7-10, 
The expected values of the mean squares in this table are derived under the 
assumptions that factors А, В, and С are fixed; the group factor and 


subjects within groups are considered random. АП interactions with the 
group and subject effects are considered negligible. 


Table 10.7-10 Analysis of Plan 9 
Кл те EE На — 


Source of variation df df for general case E(MS) 
Between subjects 9n —1 npg — 1 
с 2 4-1 а? + ро? + npo? 
Rows [AB (between)] 2 p-1 оў + po? + ngage 
C x row [ABC (between)] 4 (р = Iq – 1) оў + po? + поз 
Subjects w. groups 9(n — 1) pan — 1) оў + ро? 
Within subjects 18n пра(р — 1) 
А 2 р-і of + прдод 
B 2 р-і о? + прдоў 
AC 4 (p—1Xq—1) |92 + проз, 
BC 4 (p-1Xq—1 |o? + пров, 
(AB) 2 (p — 10р —2) og + пдоар 
(ABC) 4 (p — (p —2)q — 1) |o? + nods, 
Error (within) 18n — 18 рар — 1)(л — 1) оў 


O OOOO EE eee eee е ұ 


The analysis given in this table indicates that Plan 9 may be considered 
as a special case of a p x p x q factorial experiment arranged in blocks 
of size p, the groups having the role of blocks. The differences between 
the groups may be partitioned as follows: 


Between groups ра – 1 
С Gio 
Groups within C 4р — 1) 
Rows pid 
C x rows (р — 04-1) 


What corresponds to the within-cell variation in a p x p x q factorial 
experiment having n observations per cell is partitioned as follows in Plan 9. 


Within cell pq — 1) 
Subjects w. groups pain — 1) 


Error (within) pga — Vp — 1) 


1 
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Numerical Example of Plan9. Suppose that a research worker is inter- 
ested in evaluating the relative effectiveness of three variables in the design 
of a package. The variables to be studied are kind of material, style of 
printing to be used, and color of the material. Each of these variables con- 
stitutes a factor; there are three levels under each factor. That is, there are 
three kinds of material, three styles of printing, and three colors. A total 
of 3 x 3 x 3 = 27 different packages can be constructed. Suppose that 
all 27 packages are constructed. 

If the research worker selects Plan 9 for use, each subject is required to 
judge only 3 of the 27 packages. Under the conditions of the study, this 
number is considered to be the largest number feasible for any one subject. 
One of the main effects in Plan 9 (that of factor C) is wholly a between- 
subject effect, but within-subject information is available with respect to 
the interactions with that factor. Anticipating interactions with the color 
factor, and desiring within-subject information on interactions with the 
color factor, the research worker decides to let color correspond to factor C 
inPlan9. This plan is symmetrical with respect to factors A and B; hence 
there is no difference whether the material factor corresponds to factor 
A or B. Suppose that the material factor is made to correspond to 
factor A. 

The number of subjects to be used in each of the groups depends upon the 
precision that is desired. In related studies of this kind a minimum of 10 to 
20 subjects per group is generally required to obtain a satisfactory power. 
Depending upon the degree of variability of the judgments, a larger or 
smaller number of subjects will be required. For purposes of illustrating 
the computational procedures, suppose that there are only two subjects in 
each group. The basic observational data for this illustration are given in 
Table 10.7-11. ‘The design is that given earlier in this section; the data have 
been rearranged for computational convenience. 

The subjects are asked to rate the packages assigned to them on a 15-point 
criterion scale. Subjects are assigned at random to the groups; the order in 
Which a subject judges a set of packages is randomized independently for 
each subject. Subjects in groups 1, 4, and 7 each judge combinations aby», 
аб,ҙ, and аб. However, these combinations appear in a different color 
(factor C) for each of the three groups. Personl made the ratings 2, 2, and 
3 for the respective combinations, all of which are at color су. Person 6 
made the ratings 7, 5, and 9 for the respective combinations—these com- 
binations are in color сз. The assignment of the treatment combinations 
to the groups follows the schematic representation of Plan 9 given earlier 
іп this section. For ease in computation the data in Table 10.7-11 have 
been rearranged. In addition to the basic observed data, totals needed 
in the analysis of variance are given at the right. 

Summary data obtained from Table 10.7-11 appear in parts i and ii of 
Table10.7-12. Ап ABCsummary table appearsin parti. Entriesinparti 


558 


LATIN SQUARES AND RELATED DESIGNS 
Table 10.7-11 Numerical Example of Plan 9 


Person Group Row 
Group Person a а аз total total total 
b b b 
1 e ag) ПИР; 
а a 2 15 1 256 9 16 = б, 
3 sv EGO ati уд 
fi в 4 ОДЕЛИ NEST 28 42 
5 19,4. б 1015 
5 б 6 de ul et 36 94 = Ry 
OPTERON 5; 
7 Lae earl cals 
а e 8 Rs i5 Ara aol 37 
9 ge 222 
а Gs 10 10 14 6 |'30 52 
11 ТОЙО ЕВ МОВ 
9 M 12 TES CR Р 47 136 
bs b b 
13 ЕТІСТІГІН ГТ 
e Gs 14 6:45), |219 29 
15 ОҚЫ РАН ЕЗ 
cvs LC 16 Omen a 225 48 
17 Wee steele 
| 6 18 Sim S912) xus 46 123 
Total 14 117 12 353-G 353 353 


have the general symbol АВС, and are each based upon two observa- 
tions. For example, the entry АВС, is the sum of all observations made 
under treatment combination abe,,,. Only subjects 7 and 8 are observed 
under this treatment combination. Hence 


АВС = 5 +8 = 13. 


As another example, only subjects 11 and 12 are observed under treatment 
combination abc333; hence 


АВС = 8 + 9 = 17. 
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Table 10.7-12 Summary Data for Numerical Example 


сі | Са с 
a Total 
a az az a az аз 41 а аз 
bi 13 6 10 18 19 8 17 14 15 120 
(i) b, 3 9 14 14 24 1 12 13 12 112 
bs 9 3 15 18 20 10 20 9 17 | 121 
| 353. 
| а а, a, Total a, а, аҙ Total 
bi 48 39. Зз 50 су 25 18% 39 82 
bg 29 46 Зе Ca 50: , 62 29 142 
by | 47 32--42- 3121 C3 49 753608244. .129. 
Total| 124 117 112 353 Total| 24 117 112 353 
(ii) 
Row Row Row Group 
b, bs b, Total 1 2 3 total 
e 29- 1.269821 82 сі ө sim, 29 82 
с, 45 49 48 142 Cy 42 52 48 142 
C3 46 37: 46262129 сз 36 47 4 129 
Total| 120 112 121 353 94 136 123 353 
(1) = С?/пр?д = 2307.57 
(2) = ХХ? = 2811 
(3) = (£4?)/npq = 2311.61 
(4) = (2B})/npq = 2310.28 
(5) = (УС тра = 2418.28 
(iii) (6) = [E(AB;)]/ng = 2372.83 


(7) = [E(AC)"]/np = 2568.83 
(8) = [E(BC;)?]/np = 2429.50 
(9) = [X(ABC;4]/n = 2654.50 


(10) = (2P2)/p = 2575.00 
(11) = (262)/лр = 2476.50 
(12) = (5: 8%)/прд = 2358.94 


NS 


From the data in part i, АВ, AC, and BC summary tables may be pre- 
pared; these appear in part ii. Data for the C x row summary table in 
part ii are obtained from the column headed Group total in Table 10.7-1 1. 
Computational symbols convenient for use in obtaining the required sums 
of squares are defined in part iii. The numerical values of these symbols 
for the data are also given. 

The analysis of variance is summarized in Table 10.7-13. Factors A, 
В, and С are considered to be fixed. Tests on the between-subject effects 


TTE st | 008$ (D + (9 — 00 — © (шцим) опа 
7 v 60'6 ГО) + CD — (9 — (DI — 
KD — (G) + Ф) + ©) + (8) — (0) — (9) — (91 28V 
ісе с YUL KD = (GDI — KD + Ф) — © — (9) HV 
tre m2 168 (0 + () — Ф) — (9) 29 
xx8t II £9'9€ v ІСОРІ (SE (өзе еле 2), OV 
9€T c ш (0 = Ф (Запшиа) g 
20% T tot (prse) (Penau) y 
9€ 00'9£c (0 — © sioofqns uM e 
?601 | 6 05:86 (D — (0D Япо18 ung syoofqns = 
ILI Y 589 (0 + €D — (9 – (1 MOL х 2 
SET 89°ST га Lets (0 = (D SMON 
«905 9655 © шоп (0- (9 . . (1002) 2 
LI егі9с (D — (0D 8129148 uoojog 
d SIN JP SS v[nurioj euonryndwop UOHPIIVA JO 951nog 
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use MS.un; w. groups in the denominator of F ratios. Tests оп within-sub- 
ject effects use MSerrorwithiny: It will be noted that the denominator for 
the within-subject effects (3.22) is considerably smaller than the denomina- 
tor for the between-subject effects (10.94). 


Table 10.7-14 Analysis of Simple Effects for Factor A 


—Ó——M—— EROR (ег3е ТРИ 


Source of variation | Computational formula SS df MS | F 
А at level сү 0) — (1e) 3811.122 19.06 5.92* 
A at level су (369) — (1с) 98.1 | 2| 49.06 | 1524** 
A at level сз (3c) — (1сз) 14.33 | 2 7.16 2.22 


Error (within) (From Table 12.7-13) | 150.55 | 18 | 3.22 


The F tests indicate a highly significant AC interaction. To help inter- 
pret this interaction effect, the profiles for simple main effects of factor A 
(materials) at the various levels of factor C (colors) are given in Fig. 10.1. 
These profiles were prepared from the AC summary table in part ii of 
Table 10.7-12. Although color cy 
has the highest over-all average rat- 70 
ing, c; in combination with material аз 


has one of the lower average ratings. A 
te 720 


Material аз іп combination with color 60 
Са has the highest average rating. 
An analysis of the simple main effects / 
of factor A for the levels of factor С Бо A. Siea 
will permit statistical tests of the в K 
differences among the set of points р NG 
in the same profile. These tests = Ў 
аге summarized in Table, 10.7-14. 
Differences between the points on the 
profile for с, are statistically signifi- 
cant (F = 15.24). A test on the 
difference between a, and a; at level 
€; is given by 
= „Аби = AC 
2np MSerror(within) 
Gd HT =. 
2(6)(3.22) 


For a .05-level test, the critical value for this last test is F5,(1,18) = 4.41. 
Hence the experimental data indicate a difference in favor of material аз, 
but this difference is not quite large enough to be considered statistically 
significant. If the costs of materials a, and a; are equal, the experimental 


Figure 10.1 
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data indicate that a, is to be preferred to a, when color c; is used. How- 
ever, if there are differential costs, the decision must take such cost differ- 
entials into consideration. 

Tests on simple main effects are generally made along the dimension of 
greatest interest to the experimenter. This dimension, by design, is most 
frequently the within-subject dimension. However, should the experi- 
menter desire to make tests on the simple main effects of what corresponds 
to factor C, the denominator of such tests is not MS,,;,, groups) Rather 
the denominator is MS,, «n for the appropriate level of the simple main 


Table 10.7-15 Analysis of Variance for Michels et al. Experiment 


Source df MS F 
Between subjects 26 
Sequences (rows) 2 3.05 
C (replications) 8 5.71 
Sequence x reps 16 7.59 
Within subjects 24 
A (order) 2 56.64 5.12 
В (problem) 2 417.64 37.73 
Pooled error 50 11.07 
Order x reps 16 12.89 
Problem x reps 16 9.43 
Residuals within 
squares 18 10.90 


————— 0. 


effect. For example, in making tests on the simple main effects of factor 
C at level аз, the appropriate denominator is MS, cell(fora,)» Computa- 
tionally, 
(2a) — (9a) 

ра(п — 1) 


Illustrative Application. А study reported by Michels, Bevan, and Stras- 
sel (1958) represents a special case of Plan 9. Individual animals were 
assigned at random to the rows ofa3 x 3 Latin square. Problems (factor 
A) were assigned to the columns of the square; the order in which the 
animals were assigned to the problems was determined by the levels of 
factor B. Nine replications (factor C) of the experiment were run; the 
same Latin square was used throughout. The criterion was the number 
of trials required to learn the problems. 

Since the same square was used in each of the replications, corresponding 
rows of the squares represent a sequence effect. The experiment was 
actually conducted in distinct replications. Factor C, the replication 
factor, was considered to be a random factor. The analysis of variance 


MS. ceu(tor as) = 
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reported by the authors is summarized in Table 10.7-15. Since the inter- 
actions with the replication factor were homogeneous and did not differ 
from the residuals within the squares, a single pooled error term was used 
in making all within-subject tests. 

Plan 10. This plan is essentially an extension of Plan 6; more explicitly, 
Plan 6is the building block from which Plan 10 is constructed. The latter 
is a series of such building blocks, each at a different level of factor D. 
Alternatively, Plan 10 may be viewed as a fractional replication of a 
p хр X p X q factorial experiment. Asan illustration, a3 x 3 x 3 x 2 
factorial experiment will be used. A schematic representation of this plan 
is given below: 


а а, а; а а 4 

б, с | by by by б, | cı | by by b, 

di | Gy | c | by by b, d| Gs | ce | by by by 
б ca | by by Ge | | ba b 


As shown above, a different square is used for the two levels of factor D. 
In cases where partial information on confounded interactions is to be 
recovered, it is desirable to use the same square for each level of factor D. 

The complete factorial experiment for the illustrative example includes 
3 x 3 x 3 x 2 = 54 treatment combinations. Only 18 treatment com- 
binations appear in the above plan (i.e., one third of the total). If the 
factors forming the dimensions of the square (factors A, B, and C) do 
not interact with each other (these dimensions may, however, interact 
with factor D), then an outline of the analysis of variance is given in 
Table 10.7-16. The expected values given in this table are derived under 


Table 10.7-16 Analysis of Plan 10 


Source of variation df df for general case E(MS) 
Between subjects 6n —1 npq —1 
C(AB)  . 2 р-! оў} + ро? + npqos 
5 І (езі аё + pod + пр) 
CD (AB' x D) 2 (p—1(q—1 | 98 + роз + про, 
Subjects within groups | 6(n — 1) pgn = 1) a} + ро? 
Within subjects 12n npg(p - 1) 
S mia n z p-l о% + лрфой 
В 2 p-1 a? + прфоў 
Ар 2 (p -1Х4- 1) оё + проф, 
BD 2 (p = 143—1) 9; + проф, 
(AB 4 2 (p — 0р ~ 2) of + пдозд 
(ABY x pļresiduat [ (p = Wp -Dq — 1) | o? + nods, 
Error (within) . |12n —12|  pq(n — Xp —1) o 
SEBO Cin ЛТ qot; E "узы Тул! жы NAT 907 пре NA 
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the assumption that factors A, B, C, and D are fixed. Groups and 
subjects within groups are considered random. 

If, for example, the 4B interaction were not negligible, then the main 
effects of factor C and the CD interaction would be confounded with this 
interaction. A partial check on the assumption that dimensions of the 
Latin square do not interact with each other is provided by the F ratio 


Ps MSres 


MSerror(within) Ў 


A somewhat better check on these assumptions is obtained by computing 
the residual and error (within) separately for each level of factor D as well 
as the corresponding pooled terms. Individual tests are made for the 
separate levels as well as for the pooled terms. 

Relative to what would be the case in a complete factorial experiment, the 
within-cell variation is partitioned as follows: 


Within cell Pn - 1) 
Subjects w. groups рап —1) 
Error (within) pan — 1)(р — 1) 


All other sources of variation are part of the between-cell variation; the 
degrees of freedom for the latter are pg = 1: 

Plan 10 is useful in situations illustrated by the following example: 
Suppose that a research worker is interested in evaluating the effective- 
ness of q different methods of training (factor D) on marksmanship. 
There are p different kinds of targets (factor B) to be used in the evalua- 
tion process. Each target is to be placed at p different distances (factor 
C). Subjects are to fire at each of the P targets, but subjects are assigned 
to only one distance. The order (factor A) in which subjects fire at the 
targets is balanced by means of a Latin square. In this kind of experi- 
ment, primary interest is generally in the main effects of factor D and the 
interactions with factor D, particularly the BD and CD interactions. 
The BD interaction is a within-subject effect; the CD interaction, a 
between-subject effect. If the experimenter has the choice of what vari- 
ables are assigned as factors B and C, factor B should be the one on which 
the more precise information is desired. The experimenter may not 
always have his choice in such matters—the dictates of experimental 
feasibility frequently force a decision of this kind in a given direction. 

Computational Procedures. Computational procedures for this plan are 
summarized in Table 10.7-17. Since all treatment combinations in the 
factorial experiment involving factors C and D appear in this plan, a CD 
summary table may be Prepared. From the latter summary table one may 
compute sums of squares of the main effects of C and D as well as the 
sum of squares for their interaction. One may also prepare AD and BD 
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summary tables, since all the treatment combinations in corresponding 
two-factor factorial experiments occur for these two sets of variables. 

For the basic observations one may obtain each of the P,, where the latter 
symbol denotes the sum of the p observations on an individual subject. 
The expression (cell total) used in computational symbol (11) denotes the 
sum of the п observations made under a specified treatment combination. 
Another symbol for this total is ABCD; xm. 


Table 10.7-17 Computational Procedures for Plan 10 


(1) = G*[np'q (7) = B(AD im)" np 
(2) = Хх? (8) = [2(8Р,„)/ир 
@ (3) = (2Aj)/npq (9) = ГЕ(СО,,/Ч/пр 
(4) = (EBj)Inpq (10) = (2Рд)/р 
(5) = (“СӘ/һр4 (11) = [Xcell total)*]In 
(6) = (20% )/пр? 
Between subjects (10) — (1) 
Є (5) - (1) 
D (6) - (1) 
ср (9) - (5) - (6) + (1) 
Subjects within groups (10) — (9) 
i) Within subjects (2) - (10) 
А (3) - (1) 
B (4) — (1) 
AD (7) - (3) - © + (1) 
BD (8) — (4) — (6) + (1) 
Residual (11) — (7) — (8) — (9) + 26) 
Error (within) (2) - (10) - (11) + (9) 


——— oc ee ee 

The residual variation is that part of the variation between cells remain- 
ing after (presumably) all the unconfounded sources of variation due to 
treatment effects have been taken into account. In symbols, 

SSres = SShetween cells (SS, t SS, | 55, t SS, t SSca | 55.4 | 55,). 
The homogeneity of the residual variation may be checked by computing 
separate residual terms for each Latin square. Since each of the Latin 
squares represents a different level of factor D, the latter procedure is 
equivalent to computing a separate residual term for each level of factor D. 

SSres tor level dy, = (11d,,) (Idm) (8d,,) (9d,,) | 2(6d,,), 
where the symbol (114) is (11) with the summation restricted to level 4. 

The variation due to error (within) is that part of the within-cell variation 
which remains after variation due to differences between subjects within 
cells is removed. In symbols, 


SSerror(within) — SSw. сеп — SSsubj w. groups- 
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The latter term may be checked for homogeneity by computing a separate 
SSerrortwithin) for each group. Computationally, 


SSerror(within) forG, = (2g,) = (10g,) = (11g,) T (9g,). 


Plan 11. In Plan 7 only a fraction of the treatment combinations in a 
P X p X p factorial experiment actually appears in the experiment. How- 
ever, in Plan 11 all the treatment combinations in this factorial experiment 
аге used. The primary purpose of this plan is to obtain complete within- 
subject information on all main effects, as well as partial within-subject 
information on all interaction effects in ap X p X p factorial experiment. 
Yet only p observations on each subject are required. 


Table 10.7-18 Schematic Representation of Plan 11 


ai 4а а, а 4а ds а а ау 

бі| bey, bes, Без Gy) bese bea beig G, | be bey, beg 

() Gy] bess bca bey G5} bees Әс bea, Са | beis bego bco 
Gz | beg, beig Әс Go| bey Бсзз bcag Go| bes, bcos bes 
б, | (112) (231) (323) С, | (132) (221) (313) с, | (122) (211) (333) 
Gi) С, (133) (222) (311) G; | (123) (212) (331) Gg | (113) (232) (321) 
бу | (121) (213) (332) Ge | (111) (233) (322) G| (131) (223) (312) 


The construction of this plan will be illustrated by means of a3 x 3 x 3 


factorial experiment. The Starting point for this plan is a set of balanced 
3 x 3 Latin squares, 


I II ш 
1 S 2 3 2 1 2 1 3 
3 2 1 2 1 3 1 3 2 
2 1 3 1 3 2 3 2 1 


It is noted that square II is obtained from square I by means of a one-step 
cyclic permutation of the rows. Similarly, square III is obtained from 
square IT by a one-step cyclic permutation. One now constructs a square 
orthogonal to square I. The following square has this property: 


I 
2 1 3 
3 2 1 
1 3 2 


in parentheses represent t 
binations. 
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The п subjects in each group are observed under the treatment combina- 
tions in each of the rows. These sets are balanced with respect to main 
effects but only partially balanced with respect to interaction effects, 
For purposes of illustrating the manner in which the AB interaction is 
partially confounded with differences between groups, consider the sets of 
treatment combinations assigned to the following groups: 


G, (112 (231) (323) 
б (111) (233) (322) 
G, (113) (232) (321) 


Within each of these groups there аге repeated measures оп the same set of 
combinations of factor A and B, namely, abi, арз, and abs». There is 
balance with respect to main effects as well as the AC, BC, and ABC inter- 
actions when the sum of all observations in the three groups is obtained. 
If all factors are fixed, it may be shown by direct substitution in the basic 
linear model that 


E(Gi+6+8) = Әли + 3n(afy + аб + оба) + 3п(д; + Òs + 6). 
The symbol бү, denotes the sum of the 9n observations in these three 
groups. The effects ду, ôs, and д, designate group effects. Since б,, Ga, 
and С, are each observed under the set of treatment combinations арз, 
аб, and аб), and since there is balance with respect to the main effects as 
well as the AC and BC interactions, 


E(Go4449) = 9n + Зп(орВ,з + ар» + Apa1) + 3т(0; + 04 + ду). 
It may also be shown that 
E(Gy4547) = Inu + Зп(аВ, + «бә + oss) + 3п(д + 6 + ду). 


Thus differences between the three totals С, сув, G2+4+9 and G5, 5,; are 
in part due to the АВ interaction and in part due to group effects. Hence 
two of the four degrees of freedom of the AB interaction are partially 
confounded with differences between groups. j 

In Plan 11, as constructed in Table 10.7-18, two-factor interactions are 
partially confounded with the sets of group totals given below: 


Interaction Sets of group totals 
AB бінен» Gotio Оз+ь+т 
AC [C 
BC бір» бзб?» Оз+а+в 


In each case two of the four degrees of freedom for the respective two-factor 
interactions are confounded with differences between groups. The 
remaining degrees of freedom for the variation due to differences between 
groups are confounded with the three-factor interaction. The totals in- 
volved in the latter аге G,,2+5, біне and Суз. An outline of the 
analysis of variance appears in Table 10.7-19. 
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Table 10.7-19 Analysis of Plan 11 


Source of variation df df for general case E(MS) 
Between subjects 9п —1 пр? —1 
Groups 8 p —1 
(4B) 2 pt 
(Асу 2 р-1 
(BC) 2 D 
(ABCy 2 (p — 1p —2) 
Subjects within groups 9n — 9 p(n — 1) 
Within subjects 18n np(p — 1) 
A 2 р-і оў + npo? 
B 2 Р—1 | 02 + пр? 
С 2 р-і о? + np*o? 
(AB) 2 (p — 1p —2) ор + прозр 
(4C) 2 (р — 1p —2) оў + npo, 
(BOy 2 (p — Xp —2) вр + проф, 
(ABC) 6 (р — 18 =(p – р = 2) |o? + nob, 
Error (within) 18” — 18 P — 1n — 1) 9; 


This plan may be improved by constructing a replication in which dif- 
ferent components of the interactions are confounded with group effects. 


I II ш 
1 2 3 2 3 1 3j 1 2 
2 3 1 3 1 2 1 2 3 
3 1 2 1 2 3 2 3 1 


The following Square is orthogonal to square I: 
r 


1 2 E 
3 1 2 
2 3 1 


Table 10.7-20 Replication of Plan 11 


бі (11) (222) (333) G, (121) (232) (313) G, (131) (212) (323) 
б, (123) (231) (312) б; (133) (211) (322) G, (113) (221) (332) 
Gs (132) (213) (321) €, (112) (223) (331) G, (122) (233) (311) 


A plan constructed from this set of squares is given in Table 10.7-20. 
For this replication, it may be shown that 


Е(Су+в+в) = 9п + 3n(xf, + «б + s) + 3п(9, + % + ôs). 
The set of zf/'s included in this expected value is different from that included 
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in the original plan. For this replication it may be shown that variation 
among the totals 

Grits, Съ+а+о, апі G3+5+7 
is confounded with two of the four degrees of freedom of the AB interaction; 
however, the two degrees of freedom that are confounded here are not 
identical with those confounded in the original plan. 

In terms of the notation of Chap. 8 on components of interaction, in 
the original plan the AB? components of the A x B interaction are con- 
founded with group effects; in the replication the AB components are 
confounded with group effects. Hence the original plan provides within- 
subject information on the AB? components, and the replication provides 


Table 10.7-21 Definition of Computational Symbols 


(1) = G?[np? (6) = ГЕ(АВ,)Ч|пр 
(2) = Ух? (7) -ГЕ(АС,УЧ/пр 

@ (3) = (543)/пр? (8) = ГЕ(ВС,УЧ/пр 
(4) = (5В3)/пр? (9) = [S(ABC;;,)"I/n 
(5) = (2CR)/np? (10) = (P2)/p 


LS p cA аагарға BE 5-4% мез à Fn - 
(1) = GG 3I 

(12) = (EG3)Inp? = (біра + быт + ИРА 
(13) = (202,)/пр? = (бї ass + Ghia + ӘЛГІ 
(14) = (2G},)/np® = (бі + Сват + Озыңа)/лр* 
Мыр с OE ток артынан 


(ii) 


within-subject information on the AB components. Similarly, in the 
original plan the AC components of A x C are confounded with group 
effects; in the replication the AC* components are confounded. A com- 
parable condition holds for the B x C interaction. Use of the replica- 
tion will provide some within-subject information on all components of 
the two-factor interactions. However, additional replications are re- 
quired to obtain within-subject information on all components of the 
three-factor interaction. The original plan provides within-subject infor- 
mation on the АВ?С? component; the replication provides within-subject 
information on the ABC component. No within-subject information 
is available on the ABC? or the AB*C* components. | 

Computational Procedures for Plan 11. Computational procedures here 
are similar to those used inap хр X p factorial experiment in which there 
are n observations per cell. The first steps actually duplicate the latter 
procedures. The nonreplicated version of Plan 11 will be considered first; 
the replicated plan will be considered later. ! ; 

Since all the treatment combinations inap x p х p factorial experiment 
appear in Plan 11, an ABC summary table may be prepared from the basic 
observations. From the latter summary table one obtains 4B, AC, and 
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BCsummary tables. Most of the computational symbols in part i of Table 
10.7-21 are obtained from these summary tables. Symbol (10) involves the 
totals P, ; the latter are the sums of the set of observations on an individual 
subject. The computational symbols in part ii require special comment. 


Table 10.7-22 Computational Formulas 


Between subjects (10) = 1) 
Between groups (11) — (1) 
AB’ (12) — (1) 
AC’ (13) — (1) 
BC’ (14) — (1) ( 
АВС' (11) — (12) — (13) — (14) + 2(1) 
Subjects within groups (10) — (11) 
Within subjects Q) — (10) |] 
A (3) — (1) 
B (4) — (1) 
С. (5) — (1) 
AB" i (6) — (3) — (4) + (] — (02) — D] 
АС" (7) - (3) - (5) + (11-103) — (] 
ВС” [(8) = (4) — (5) + (1)] — [(14) -(ІЛ 
АВС” (9) - (6) — (7) — (8) + (3) + (4) + (5) 
= (1)] — [(11) — (12) — (13) — (14) + X] 
Error (within) IQ) — (9)] — [(10) — (11)] 

The symbol G, denotes the sum of the np observations in groupm. The 
symbol G,, is the sum of those G,,'s which are assigned to the same set of 
аб. For assignments made in accordance with the principles given in the | 
last section, ina 3 x 3 X 3 experiment the Gas are 


Gigs, буаз» апа Gsis+r 


where бі. = Gi + G, + Gs. Іп general there will be np? observations 
in each of such totals. The С.г are made up of the following totals: 


Gitaro Gz+5+8 and Сз+в+ә- 

Each of the latter G,,’s which are combined into a single total is assigned 
to the same set of ac. For example, inspection of Table 10.7-18 indicates 
that groups 1, 4, and 7are assigned to the sets ас, Co, and асзз. The level 
of factor B changes for the different groups, but the set of ac,, remains the 
same. 

Computational formulas for the sum of squares are summarized in Table 
10.7-22. The parts of SScrrorwithiny may be checked for homogeneity by | 
computing a separate sum of Squares for each of the groups. Thus, | 


SSerror(within) for Gm = Ren) — (92,)] — [(10g,) —( 1g,)]. 


where the symbol (2g,,) represents the symbol (2), in which the summa- 
tion is restricted to Gn 
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If Plan 11 is replicated, it is generally wise to carry out separate compu- 
tations for each replication; those terms which are homogeneous 
may then be combined. Since different components of the two-factor 
interactions are estimated in each of the two replications, the two-factor 
interactions are in a sense nested within the replications. Thus, 


55), = 594) trom rep 1 + 554) trom rep 2- 


Similar relationships hold for other two-factor interactions as well as the 
three-factor interaction. The error (within) term for the combined repli- 
cations is given by 


SSerror(within) = SSerror(within) trom rep 1 + SSerror(within) trom rep 2. 


In a sense, the error (within) effects are also nested within each replication, 

Main effects, however, are not nested within replications. The latter are 
computed by obtaining A x replication, В x replication, and C x replica- 
tion summary tables from the basic data. From such tables main effects 
and corresponding interactions may be computed. The latter interactions 
are pooled with error (within) if the components prove to be homogeneous 
with error (within). 

Plan 12. This plan resembles Plan 5 as well as Plan 8 but yet has 
features that neither of the latter designs has. A schematic representation 
of Plan 12 is given below: 


| сі C2 


| 

w 
ee à bh 5 
| sed 7 


б, а а аз a бі 9 
Gy аз аз а | ds а; a 
G | а, аз а | ау аз а; 


In general, there will be p levels of factor A and p levels of factor B. The 
same p x p Latin square is used at each of the q levels of factor C. Plan12 
may be regarded as a fractional replication ofap хрх 4 factorial experi- 
ment. If the interaction with the group factor is negligible, complete 
within-subject information is available for the main effects of factors A, 
B, and C. Complete within-subject information is also available on the 
AC and BC interactions; partial within-subject information is available 
on the AB and ABC interactions. 

The analysis of variance is outlined in Table 10.7-23. In obtaining the 
expected values of the mean squares it is assumed that factors 4, B, and C 
are fixed ; subjects within the groups define a random variable. The terms 
оё, 0%, and ož, appearing іп the expected values, represent interactions with 
subject effects. Because factors A and Bare dimensions of a Latin square, 
the A x subjects within group interaction cannot be distinguished from the 
B x subjects within group interaction. However, the C x subjects within 
group interaction can be distinguished from the others, since factor C is not 
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Table 10.7-23 Analysis of Plan 12 


Source df E(MS) 

Between subjects тр -1 

Groups p: 

Subj w. groups pn — 1) 
Within subjects np(pg — 1) 

A et P оў + оў + прдоё 

B zi о + ой + проз 

(АВ)! (p — p — 2) оё + OF REI 
Residual (1) p(n — 1)(р — 1) ор + оё 

6. 4-і of + 02 + np*o? 

C x groups (p = DG —1) оў + o? + про?, 
Residual (2) Р — 1)(4 — 1) [E 

4C (p — DG — 1) б + оъ + проз, 

BC (p — DG — 1) оё + of, + про), 

(АВУС Ф-Ір-24-1) оё + о + пау, 
Residual (3) рп — (p — 1g — 1) о? + о%, 


Pooled error = residual (1) + residual (2) + residual (3) 


df pooled error = p(n — (рӯ =) 


Table 10.7-24 First Stage in the Analysis of Plan 12 


Source df 

Between subjects np -1 

Groups р-і 

Subjects w. group pn — 1) 
Within subjects np(pg - 1) 

B “pat 

B x group (pics 

В x subj w. group pn — 1)(р — 1) 

с 4-1) 

C x group (p = Ig — 1) 

C x subj w. group pn — 10g — 1) 

BC (р — 1g – 1) 

BC x group A rm 1) 


BC x subj w. group 


(p 
pn — 1)(р — 1g — 1) 
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one of the dimensions of the Latin square. In general, the three residual 
terms should be pooled into a single error term if there is neither a priori 
nor experimental evidence for heterogeneity of these components. 

The computational procedures for this plan are simplified if the analysis 
of variance is carried out in two stages. In the first stage one of the dimen- 
sions of the Latin square, say, factor A, is disregarded. Then the plan 
reduces to a p x q factorial experiment with repeated measures on both 
factors. The analysis of variance for the first stage is outlined in Table 
10.7-24. The detailed computational procedures given in Sec. 7.3 may be 
adapted for use to obtain the first stage of the analysis. 

In the second stage of the analysis, the presence of factor A as a dimen- 
sion of the Latin square is taken into account. The В х group interac- 
tion is partitioned as follows: 


В х group (p- 
A DISSE 
AB' (p - (р -2) 


The latter interaction term may be obtained by subtraction or by the 
relation 
SS,,; = 55,, — SSgroups; 


Where SS,, is obtained from an 4B summary table for the combined levels of 
factor С. The BC x group interaction is partitioned as follows: 


BC x group (р = 1)? 
АС (р = (qq – 1) 
АВ'С (p — Ip ~ 2(q – 1) 


The latter interaction term may be obtained by subtraction from the 
following relation, 
581. = 55, — SSe х group, 
where SS,» is obtained from an ABC summary table. y 
The residual terms in Table 10.7-23 are equivalent to the following 
interactions: 
88,051) = SS, x subj w. groups 
SSres(2) = SSe x subj w. groups 
SSresc3) = 55% x subj w. group: 
If only the pooled error is obtained, the latter is given by 
SSpootea error — SSw. cell — SSsubj w. group: 
Illustrative Application. An experiment reported by Briggs, Fitts, and 
Bahrick (1957) represents a special case of Plan 12. In this study the 


dimensions of a 4 x 4 Latin square were noise level (factor A) and blocks 
of trials (factor B). Factor C represented three experimental sessions. 
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There were three subjects in each of the groups. The groups were as- 
signed at random to the rows of the Latin square. The groups were 
required to track a visual target under various levels of visual noise. 
There were three experimental sessions; within each session there were 
four blocks of trials—each block was under a noise level determined by 
the letters of a randomly selected Latin square. The same square was 
used іп all sessions. The rows of the Latin square define the sequence in 
which the noise levels were given. Hence the group factor may be 
considered as a sequence factor. 


Table 10.7-25 Analysis of Variance for Briggs et al. Experiment 


Source | df | MS F 
Between subjects | dt 
G Groups (sequence) 3 160 
Subj w. group 8. “1.1369 | 
Within subjects (аз | | 
C Sessions 2 2546 13.35 
сс 6 16 
C x subj w. group 16 191 
A Noise level 3 5289 | 3131 
B Blocks of trials 3 330 1.95 
AC 6 293 | 1.73 
BC ce 109 
(АВ) Square uniqueness 6 148 
(AB) x C(C x square uniq.) 12 95 
Residual 72 169 | 


The analysis of variance reported by the authors differs slightly from that 
given in Table 10.7-23. Their analysis is outlined in Table 10.7-25. The 
residual term in this analysis corresponds to residual (1) + residual (3). 
The C x subjects within group term corresponds to residual (2). Мо 
attempt was made to interpret the partial information on the AB and ABC 
interactions. Instead, these sources of variation were considered to be 
functions of the particular Latin square selected for use. The latter inter- 
pretation is the preferred one if the sequence (group) factor can be con- 
sidered as contributing toward these interactions. If there is no reason 
for considering residual (2) as potentially different from residuals (1) 
and (3), the experimental data do not rule against pooling the equivalent 
of residuals (1), (2), and (3) into a single pooled error term to be used in 
testing all within-subject effects. 

Plan 13. This plan resembles Plan 9; in this case, however, a Greco- 
Latin square replaces the Latin Square. A schematic representation of 
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Plan 13 is given below: 


| сі с сз e сь сз 

| G, | aby abs, абу G, | аба abs, abs 
а, | б, | abs, аһы аба d, |. Gg |а Gb ab 
| Gy | аб аба аба | G | aby абз аба 


The same Greco-Latin square is used for each level of factor D. 

The analysis of variance for this plan is most easily understood if it is 
made in two stages. In the first stage, two of the four dimensions of the 
Greco-Latin square are disregarded. Suppose that factors 4 and B are 


Table 10.7-26 Analysis of Variance for Plan 13 


Source df 
Between subjects: пру —1 
Rows pl 
D 4-1 
D x row (р = 104—1) 
Subjects within groups pain — 1) 
(i) Within subjects прф(р = 1) 
С p-l 
C x row -1» 
CD (р -Dq - 1) 
CD x row (p = Ig -1) 
C x subj w. group (Error) pin = 0р = 1) 
С x row (pes 1)? 
A pet 
B p-! 
(ii) (ABY (р = 1Xp = 3) 
GD-X TOY (M D 
AD (p -Wq = ) 
BD (p = Iq = 1) 
(AB) р (p = Wp -34 - 1) 
DAES ee OCE 


disregarded. The resulting plan may be considered as ag X p X p fac- 
torial experiment with repeated measures on one of the factors. The 
analysis of variance for the first stage appears in the upper part of Table 
10.7-26. 

In the second stage, the interactio 
square (C and rows) are partition 


ns which involve two dimensions of the 
ed into main effects and interactions 
associated with factors A and B. This stage of the analysis is shown in the 
lower part of Table 10.7-26. It should be noted that (4 B) cannot be dis- 
tinguished from (4C)' or (BC). This latter source of variation is some- 
times called the uniqueness of the Greco-Latin square. In the partition of 
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the CD x row interaction, (4B)'D cannot be distinguished from (4C)'D 
or (BC) D. 

If all factors are fixed, all within-subject effects are tested by means 
of F ratios having C x subjects within group as a denominator. If 
interactions between the dimensions of the square are negligible, (4B)' 
and (АВ) D may be pooled with the experimental error. Тһе F ratios for 
between-subject effects have subjects within groups as a denominator, 


Table 10.7-27 Analysis of Variance for Briggs Data 


Source | df MS F 
Between subjects | 79 
R Rows (sequences) 3 | 1,088 2.29 
D Squares (initial learning) 4 8,975 18.85 
DR 12 427 
Subjects w. groups 60 476 
Within subjects 240 | 
A Lists 3 | 1,66% 10.80 
B Overlearning 3 22,389 144.94 
C Session 3 755 4.88 
AD 12 122 
BD IF TON 346 2.24 
CD 12 127 
(Ав) 3 152 
(4вур 12 114 
Еггог 180 154 


Illustrative Example. Briggs (1957) reports an experiment which is a 
special case of Plan 13. A 4 x 4 Greco-Latin square formed the basis for 
the plan. Groups of four subjects each were assigned to the rows of à 
Greco-Latin square. The letters of the Square represented the lists (factor 
A) of syllables to be learned and the degree of overlearning (factor B). The 
columns of the square (factor C) represented the experimental session. 
There were five squares (factor D) in all—subjects in different squares 
received different amounts of initial learning. The same Greco-Latin 
Square was used throughout; hence corresponding rows of the squares 
Tepresent the same sequence of combinations of factors A and B. 


A slightly modified analysis of variance for the Briggs data is given in 
Table 10.7-27. 
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Summary of Plans in Sec. 10.7 


Plan 5 Plan 6 Plan 7 
ЕЕ а, а а 4% а а 44 |6 
Gı | by by by G,| & | 5 bs 5 G, | beg bca bca 
Gy | b bao Bs Ga | e | Bs, by By б, | bc bes, bes 
Gs | bg by b, Gy. |с | by Бы Сз | beg боз beo 
Plan 8 
ба а ds | а а а 
СЕ is ен Gil bebe Dy 
n Ga | ba by b, ca | G5 | by by by 
s | bs bi b, e| bs bi ba 
Plan 9 
| a а 4 а, 4 % а а ds 
Gy | by by by Ga | by by b, Gr | by by b, 
cı | Gy | by ba bs € | Gs| by by by в | Gg | bi by bs 
Gs | by by b, Ge | bs bi by o| bs b ba 
Plan 10 
а а; ds а 4% а 
| Gi | c | bi ba G,| cı | ba bi by 
dı | Ga | Ca | ba by by ds | Gs со | by bs bs 
Gz | cĉ | bi by bs Ge | cs | bs by bi 
Plan 11 
a а; аз | а а; аз a а; аз 
Gi| bey, beg, bcs Ga | beg beg Без б, | beg, bey безз 
Ga |. begs be Әс G; | bcos bey, beg Gg | beig бс» beg 
Су | bca beig bcag Ge | be bess eae Go | beg, bcos beig 
Plan 12 
с с 
by by by | by by bs 
Gij а а % | а а 4% 
Go| ау а @ | a3 а, а, 
Сз a а a| a а; Ф% 
Ріап 13 
сі Ca с e Ce E 
G, | аба abs, аба G, | аба abs, абы 
di| G} | аба аба абу d, | G; | аз ањ abı 


б; | aba аба аба Ge | абд аба abs, 


CHAPTER 11 


Analysis of Covariance 


11.1 General Purpose 


There are two general methods for controlling variability due to experi- 
mental error—direct and statistical. Direct control includes such methods 
as grouping the experimental units into homogeneous strata or blocks, 
increasing the uniformity of the conditions under which the experiment is 
run, and increasing the accuracy of the measurements. Replicated experi- 
ments, randomized block designs, repeated-measure designs, split-plot 
designs, incomplete-block designs—these designs use the direct-control 
principle to increase the precision of the experiment. 

In this chapter, designs which use an indirect, or statistical, control (1) to 
increase the precision of the experiment and (2) to remove potential sources 
of bias in the experiment will be discussed. The latter objective is one that 
is particularly important in situations where the experimenter cannot assign 
the individual units at random to the experimental conditions. Statistical 
control is achieved by measuring one or more concomitant variates in addi- 
tion to the variate of primary interest. The latter variate will be termed the 
criterion, or simply the variate: the concomitant variates will be called 
covariates. Measurements on the Covariates are made for the purpose of 
adjusting the measurements on the variate. ‹ 

For example, suppose that the purpose of an experiment is to determine 
the effect of various methods of extinction upon some kind of learned re- 
Sponse. The variate in this experiment may be a measure of extinction; 
the covariate may be a measure associated with the degree of learning at the 
Start of the extinction trials. As another example, suppose that the purpose 
of an experiment is to measure the effect of various stress situations upon 
blood pressure. In this case а measure of blood pressure under a condition 
of no stress may be the covariate. As still another example, suppose that 
the purpose of an experiment is to evaluate the effect of electrical stimulation 
on the weight of denervated muscle. The weight of the corresponding 
normal muscle may serve as a covariate in this type of experiment. 
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Direct control and statistical control may, of course, be used simul- 
taneously within the same experiment. One ог more variates may be under 
direct control; one or more variates may be under statistical control. 

Suppose that the criterion means in a single-factor experiment are 
designated 

AD aise es 


and that the covariate means are designated 
Xo ЖИ КЕ 


Primary interest lies in differences among the Y,’s. Suppose that differ- 
ences in the ¥,’s are due to sources of variation related to the Y,’s but 
not directly related to the treatment effects. If this is the case, then more 
precise information on the treatment effects may be obtained by adjusting 
the Y,’s for the effect of the X/s. Suppose that the adjusted means on 
the criterion are designated 
О A ioral s 

There are many ways of adjusting the criterion for the influence of the 
covariate. In some cases the adjustment may take the form of a simple 
difference between the corresponding means; in other cases the adjusted 
value may be the ratio of corresponding means. The appropriate adjust- 
ment for the influence of the covariate may, in some cases, be determined 
by experience gained from prior experimentation. More generally, the 
average effect of an increase of 1 unit in the covariate upon the variate is 
given by some form of regression analysis. Although the form of the 
regression need not be linear, only the linear case will be considered in 
this chapter. In terms of a linear regression, an adjusted criterion mean 
has the following form, 5 

Y; = Y,—(X,— X), 

where b is an estimate of f, the population linear-regression coefficient. 
The estimate of f is obtained from the data of the experiment. . 

In terms of the basic linear model underlying the analysis of variance, 


Y, =u +B- X) * 7; & 

Yj =u+7, + 8; 
Thus differences between the adjusted criterion means are free of the linear 
effect of the covariate. If the effect due to the covariate is essentially 
linear, then this kind of adjustment process is adequate; otherwise the 
adequacy of the adjustment is questionable. Тһе ғ, term in the linear 
model includes the nonlinear variation in the covariate as well as all 
other sources of variation which are not under direct or statistical control. 
If the covariate measures the effects of some environmental source of 
variation that tends to inflate the experimental error, removal of the 
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influence of the covariate is equivalent to statistical control on the experi- 
mental error. 

The change in experimental error due to this adjustment process depends 
upon the linear correlation between the variate and the covariate. If this 
correlation is equal to p, and if the experimental error per unit, disregarding 
the covariate, is 05, then the experimental error after the adjustment is 


01 — 2! +] 


4 2. 


where f, represents the degrees of freedom for the estimation of o». Tf, 
instead of using a covariance adjustment, the experimental units could be 
grouped into blocks (equivalently, into strata or classes) which are homo- 
geneous with respect to the covariate, and if the relation between X and Y is 
linear, then the effect of this kind of blocking is to reduce the experimental 
error from 


4 to oX1 — p 


Thus, when regression is linear, covariance adjustment is approximately 
as effective as stratification with respect to the covariate. However, if the 
regression is not linear and a linear adjustment is used, then stratification 
will generally provide greater reduction in the experimental error. Ina 
real sense, stratification is a function-free regression scheme. 

When the covariate is actually affected by the treatment, the adjustment 
process removes more than an error component from the criterion; it also 
removes part of the treatment effect, An analysis of variance on the co- 
variate will sometimes throw some light on whether or not the treatments 
are affecting the covariate, Apart from its function in reducing experi- 
mental error, the covariance adjustment may also be used to remove bias 
due to the covariate in studies where the experimenter must work with 
Intact groups. 

To illustrate this latter point, Suppose that the criterion means for treat- 
ments / and m may be expressed in terms of the following linear model: 


Y, = My +7; + Pits, — n.) + е, 


Ym = Hy + Tm + Ви. — и.) + Em: 
Then the difference between the criterion means has the expected value 


EY, — Y) = (r, —7,) + Bus, — Han). 


The expected value of this difference is not a function of treatment effects 
alone—there is a bias which is a function of the difference between the 
covariate means and the magnitude of the regression coefficient. If the 
experimental groups are matched with respect to the covariate, or if f = 0, 
the bias disappears. However, if there are differences in the covariate 


| 
| 
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which cannot be subject to direct control and if 6 0, then the bias may be 
removed by a covariance adjustment. Direct control over this kind of bias 
is generally not possible when the experimenter is forced to work with intact 
groups. In such situations, statistical control of relevant sources of varia- 
tion is the only method available to the experimenter. 

In cases where differences between the covariate means Y; and Y; are 
relatively large, adjustment for this kind of bias is generally not based upon 
statistically sound principles. Such adjustments often require extrapola- 
tions which are either beyond the range of the data or in regions of the 
range where there are few observations. At best covariance adjustments 
for initial biases on the covariate are poor substitutes for direct controls. 
A more complete discussion of this latter point is given by Cochran 
(1957). An excellent summary of the wide variety of uses for the analysis 
of covariance is contained in a special issue of Biometrics (1957, 13, no. 3), 
which is devoted entirely to this topic. 


11.2 Single-factor Experiments 


The use of covariates to increase the precision of an experiment or to 
remove sources of bias is possible for all the designs that have been con- 
sidered in earlier chapters. The principles underlying adjustment for 
covariates will be discussed in some detail for the case of a single-factor 
experiment. The generalization to other designs is relatively direct. The 
analysis of variance for designs which include covariates is frequently 
referred to as the analysis of covariance. 

Consider a single-factor experiment in which there are k treatments. The 
notation that will be used for this case is summarized in Table 11.2-1. It 
will be assumed that there are n experimental units assigned to each of the 
treatment conditions. Two measurements are made on each of the experi- 
mental units—a measurement on the criterion, Y;;, and a measurement on 
the covariate, Y; (Only the case of a single covariate will be considered in 
this section; the generalization to multiple covariates is direct.) (М 

In the notation given in this table, Т represents л times the variation 
among the covariate means, E,» represents the pooled within-class variation 
on the covariate, and S,,, represents the total variation for the covariate. 
T,,, E,,, and S, are, respectively, the analogous sources of variation for the 
criterion. The symbol 7,, represents л times the covariation between the 
covariate and criterion means, E, represents the pooled within-class co- 
variation between the covariate and the criterion, and Spy represents the 
total covariation between the covariate and the criterion. 

Based upon the data for the n observations within treatment class j, the 
best estimate of the linear regression coefficient for the prediction of the 


criterion from the covariate is 


= My 


а 


"O0 


#Худх = “a + "= 


AX — "Х)дх = "I + “L = “g 


QDUBLIEAOD Jo SISÂ[EUV Ə} 10у UOYLJON 1-СТІ AQEL 


Ad — "ag = “а + = ту 
ne = M бел бы) ләт ел = c8. 
1 ! 
а - ADR = ея (а — Xy — "ху = “а fxd Beit ay 
Ad — '4)хи= “р (4 — 4x – хи = = AX — xxu = “L 
ЕГЕР 2-7 
2p = "pe My = "A жер ay x fey fay Sale шү "y ung 
uo: aug 594 sc | шұ Li g 
жү wy ay D^ Wy TA 
My arr "y пт тү ur 
A учәшуеәлү, xa f'juounvo1p e реше 
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b, is called a within-class regression coefficient. The corresponding within- 
class product-moment correlation is 


r = biy E) = Еш, 
xy. Uu ТА 
" : Бу; VER Eun 
If it can be assumed that 

By Bj: By 8. 


i.e., that the regression coefficients within each of the treatment classes are 
homogeneous, then within-class information from all the treatment classes 
may be pooled to provide a single estimate of the parameter f. This 
pooled estimate of В is 


Homogeneity of within-class regression is one of the fundamental assump- 
tions underlying the analysis of covariance. 
Adjusted criterion means have the general form 
Y; = Y; —b(¥,— X), 

where b is the pooled within-class regression coefficient. The variation of 
Ў; is not, however, used in making an over-all test of the hypothesis 
that о? = 0. Because of sampling error in the estimation of f, and 
because the values of Ў; are not statistically independent (adjust- 
ments are correlated), a valid F test on the over-all hypothesis that o? = 0 
requires a somewhat roundabout method for computing the numerator 


of the F ratio. А ] [ 
Suppose that the treatment classes are disregarded and a single linear 


regression equation is computed for the entire set of kn pairs of observations. 
Assume that this regression equation is 
== (Жу X)4 Y. 
The residual sum of squares about this regression line may be shown to be 
ZX(Y,— YG)? = Sy — (5/8): 

If there аге n observations in each of the К treatment classes, there are kn 
statistically independent experimental units. The degrees of freedom for 
the variation of the residuals are kn — 2. ks 

The regression equation based upon pooled within 
has the form 


-class information 


ү; = Ху = X) + Y» 
The sum of squares of residuals about this regression line may be shown 


to be 
Ез; 
ХХҮ;; = YA): = Ey — E. i 
тт 


This source of variation has k(n — 1) - 1 degrees of freedom. 
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The reduced sum of squares due to treatments is, by definition, the dif- 
ference between these two residuals, i.e., 


Si Её 
Tun (5,58) - (e, 58). 


au 


Since $,, — E,, = T,,, the reduced sum of Squares for treatments becomes 
Sa heey 


cec c d 
wR уу а НІ? po 
To summarize, the residual variation about the over-all regression line has 
been divided into two parts. One part is a function of within-class residual 
variation; the second part, the reduced sum of squares due to treatments, is 


a function of between-class variation. 


Table 11.2-2 Analysis of Variance and Analysis of Covariance 


Source SS df MS 
Total | Буу kn —1 
4) Eror T k(n — 1) MSerror 
Treatments Ту, = Syy — Ey k-1 MStreat 
е == ld dave. 
Total Soy = Sy — (Sh, Su) Loco MN 
(ii) Error Е,-Е,-(ЕМЕШ | Кп — 1) —1 MSerror 
Treat 
(reduced) Tur = Spy — Ej, к-і MStreat 
MStreat MStreat 
a" F = M E feeds 
ii) MSerror МУ гог 


Kk —1,kn 0] 0), k(n — 1) — 1] 


This partition is given in part ii of Table 11.2-2. By way of contrast, 
the analysis of variance is shown in parti. Itis seen that the analy- 
sis of variance partitions the total criterion variation into between- and 
within-class sources. In contrast, the analysis of covariance partitions the 


residuals from an over-all linear regression into between- and within-class 
sources, 


The linear model in this analysis has the form 
Yj = Yy — ВОХ, — X) tery Fey: 
A test on the hypothesis that от = 0 makes use of the statistic 
F — MStreat к 
MSerror 


When the hypothesis being tested is true, numerator and denominator may 
be shown to be independent estimates of a common residual variance. 
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This F ratio has the general form of an F test in regression analysis. 
This form is 

SDE est) 
Di[k(n — 1) — 1]" 
where D" is the variation of the residuals about the over-all regression line 
and D is the variation of the residuals about the pooled within-class line. 
It is of interest to indicate the relationship between variation due to 
adjusted treatment means, Y; = Ў, — b(X; — X), and the reduced varia- 
tion due to treatments. (The variation due to the Y;’s will be denoted by 
Тл.) Toward this end, a third regression equation needs to be intro- 


duced—the regression equation determined from the k pairs of means. 
This latter regression is called the between-class regression. It has the form 


У = b"(X; B X) 25 Y, 
where b”, the between-class regression coefficient, is 
Ti 


p" = im. 
Т, 


This regression equation provides the best linear fit for predicting Ӯ, from 
X, The variation of the residuals of the Y, about this regression line is 
RP Ta 
nxY, F em 2: 
y 
Cochran (1957) has shown that 
а) Toya = Ty — 26T xy + Т,, 
= Ty bb" Tas F ЁТ. 
From the relation (1) it may be shown that Tyr and 7,,, are related as 
follows: 
TEDL T by 
Пааа 
The relationship іп (2) shows that Tyr will always be less than Тул. Тһе 
latter is often used as an approximation for Т,„п- The relationship given 
in (1) provides a simple method for computing 7,, ,. If the treatments do 


not affect the covariate, Ту will generally be close to Tyr 
A test on the difference between two adjusted treatment means has the 


(2) Tyr = Twa — 


form Ы (f; Т; 
25 PX 
ws]? 4 C Ч | 
n РА 


The degrees of freedom for this F ratio are 1 and k(n — у— 1, the latter 
being the degrees of freedom for MS;. Note that the variance of the dif- 
ference between two adjusted means depends upon the magnitude of 
(X, — ¥,,)2. The average variance of the difference between two adjusted 
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means, averaged over all values of (X, — Х„)°, is 


2MS; | ү Telk = D, 
n E, 


This average variance of a difference between adjusted means may be 
used in making tests between several pairs or all possible pairs. 

Ratios of various sums of squares are readily translated into correlation 
coefficients. The square of the within-class correlation is 


Е.Е, 

уу 
This correlation coefficient measures (һе goodness of fit of the within-class 
regression lines, using the regression coefficient b, to the individual observa- 
tions, The square of the between-class correlation is 


2 
2 Tul Tox 
Tl'between class = Т. . 


vy 


2 
Tw.class — 


This correlation measures the goodness of fit of the between-class regres- 
sion line to the criterion means. 

Tests on the Assumptions Underlying the Analysis of Covariance. All 
the assumptions underlying the usual analysis-of-variance approach are 
also required in the analysis of covariance. In addition, there are as- 
sumptions about the regression effects. First, it is assumed that treat- 
ment effects and regression effects are additive. Implicit in this is the 
assumption that regressions are homogeneous. Second, it is assumed 
that the residuals are normally and independently distributed with zero 
means and the same variance. Implicit in this is the assumption that the 
proper form of regression equation has been fitted. Ifa linear regression 
18 used when the true regression is curvilinear, then the assumptions 
made with respect to the residuals will generally not hold. 

Evidence from the usual analysis of variance indicates that F tests in the 
analysis of covariance are robust with respect to the violation of the two 
assumptions, normality and homogeneity of the residual variance. The 
effect of nonhomogeneity of within-class regression, which is analogous to 
lack of additivity, has not been studied. There is an internal check that can 
be made on the assumption that the within-class regressions are homo- 


geneous. Toward this end, the error (within-class) variation may be sub- 
divided into the following parts: 


ТЕЬ о пазарни 


Source SS * df 
Error Ey, — (Е? ЈЕ.) k(n = 1) –1 
Sy Ey, — У(Е? Еш) k(n — 2) 


% | XGRJE.) — (ER JE.) k-1 
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S, may be shown to be the variation of residuals of the form 

X Y; шу 5, 

i 
where Yu = 26,6 om Xj d Y,. 
This last regression equation is determined entirely from the data within 
treatment class j. (Each treatment class has its own regression line.) 
Within each class the variation due to the residuals has n — 2 degrees 


of freedom; the residuals from the К classes will have k(n — 2) degrees of 
freedom. 

S, may be shown to be a function of X(b; — 0), the variation of the 
within-class regression coefficients about the pooled within-class regression 
coefficient. The larger this source of variation, the less reasonable it is to 
assume that the corresponding population regressions are equal. А test of 
this latter hypothesis, i.e., the hypothesis that f, =‘ = f; = ``‘ = Вь, 
has the form 


Lose cay 
Б Қа = 2) 


When the hypothesis being tested is true, this F ratio has a sampling 
distribution which can be approximated by an F distribution having 
k — 1 and k(n — 2) degrees of freedom. 

—— noL RS ан ем тақа т = 


Source 55 df 
Treatments ; 
(reduced) Т,,-(8%/54) + (Ezy! Eza) K= 
53 Т, = Qu Tos) к-2 
S, (Т?Т) + (ЕЕ) — (82/8) 1 


The reduced sum of squares may also be partitioned into two parts. 
S, is a measure of the variation of the treatment means, Y, about the 
between-class regression line. The larger this source of variation, the larger 
the deviation from linearity of the between-class regression. Sy 15 a 
function of (b” — Б), the squared deviation of the between-class regres- 
sion coefficient and the pooled within-class regression coefficient. 

If the within-class regression is linear, and if the covariate is not affected 
by the treatments, it is reasonable to expect that the between-class regres- 
sion will also be linear. A test of the hypothesis that the between-class 
regression is linear has the form 
e 53/06 —2). 

MSéerror 


When the hypothesis being tested is true, this F rati 
bution which is approximated by an F distribut 


о has a sampling distri- 
ion having k — 2 and 
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k(n — 1) — 1 degrees of freedom. IF this Tegression does not prove to be 
linear, interpretation of the adjusted treatment means becomes difficult, 
A test of the hypothesis that Poetween class = By, class has the form 


MSerror 


The degrees of freedom for this statistic are 1 and k(n —1) — 1. Note 
that Bretweon aas 18 estimated by 5”, whereas f, eass is estimated by b. 

In the analysis of covariance, the residual variation about an over-all 
regression line which is fitted to the entire set of data, disregarding the treat- 
ment classes, is what is partitioned. An approximate test on the linearity 
of this over-all regression is given by 


F — (S2 + 5з + S4)/2(k — 1) 
S,/k(n — 2) 
The degrees of freedom for this F ratio are 2(k — 1) and k(n — 2). 


The four different regression lines that have been used along the way are 
summarized in Table 11.2-3, 


Table 11.2-3 Regression Equations in the Analysis of Covariance 


Regression equation Residual variation 
Within class j Yä —b,—X)- Y, | Evy, = Ey, — (Ed, | Eso) 
Pooled within class Y; -bX,— X) Y, | Ey Shy (ЕЗ Е) 
Between class YP = 6X, — X) + ¥ 7%, — Ty, — (T2, T2) 
Over-all Yr = bX, = Xe Siy = 5, — (S2y/Sizx) 


yy E 9у 
RM UI аладан ы аланы UN 
11.3 Numerical Example of Single-factor Experiment 


Computational symbols that will be used in the numerical example are 
summarized іп Table 11.3-1. Тһе symbols in parts i and ii are required only 
if tests on the assumptions underlying the analysis of covariance are made. 
Symbols required for the over-all analysis of covariance are summarized in 
parts iii and iy, 

, The numerical data in Table 11,3-2 will be used to illustrate the computa- 
tional procedures, Suppose that the k = 3 treatments represent different 
methods of training. The experimenter is not at liberty to assign the sub- 
Jects at random to the different methods of training; he is required to use 
groups that are already formed, However, from all available information, 
the groups were not originally selected on the basis of variables that are 
considered directly relevant to the study; for most practical purposes the 
groups can be considered as random samples from a common population. 

There are n = 7 subjects in each ofthegroups. The groups are assigned 
at random to training methods. Before the subjects are trained under the 
method to which they are assigned, they are given a common aptitude test. 
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Table 11.3-1 Computational Formulas for Analysis of Covariance 


0х) = EX}, 
(i) i 
Gx) = Т?п 


Ej, = (2х1) — (3ху) 
Ш) * 


Ej, = (2x1) — (хь) 
(1x) = G2/kn 
(2х) = XQxj) 
(iii) Saxe 
(3x) 


= (ET2)In 


$ = (2х) — (1х) 
бу) Exe = (2х) — (3х) 
Т» = (3x) — (1х) 


Оху) = XXjY; 
xy) = Ta Tyin 
Ey, = (Оху) — 
Ean 


= (ху) = Bxyx) 


(ху) = G,G,]kn 
(ху) = XQxyj) 
-XEXY 
(3ху) = Xxy) 
= (ET, T,)|n 


Szy = Qxy) - (Ixy) 
Ey = Qxy) — Gxy) 
Т.у = (3xy) — (Ixy) 


Оу) = ЖҮР 
i 
(3у) = Tin 


Ey, ind (2yx) = (Зу) 


(19) = G?/kn 

оу) = ХОу) 
= Хү? 

Gy) = У(Зу,) 
= (ЕТ})п 


Sy = (2у) - (ly) 
E,,.7 Qy) — Gy) 
Ty = (Зу) — (ly) 


Жуу = 5, — (Si, Sox) 


(v) ГО, -Е,у- (Е? ГЕ) 
Тур = Sy, ~ Evy 


а ——————_———— 


The scores on this test define the covariate measure X. After the training 

is completed, the subjects are given a common achievement test over the 

material covered in the training. The score on the latter test is the criterion 

measure Y. For example, the first subject in treatment class 1 made a score 

of X = 3 on the aptitude test anda score of У = боп the achievement test. 
In part i of this table, the entries under treatment 1 are as follows: 


XX,-15 Хи = 31 
Е = as! 
УХ;Ү; = 75 

i 


d the sum 


In words, these entries are the sum, the sum of the squares, an ne 


of the products of the pairs of observations under treatment 1. 
totals to the right of part i are as follows: 


xmj-s 60) 
у(х) = 196 PI x) = 881 


х(5ха5)) = 398 


590 ANALYSIS OF COVARIANCE 
Table 11.3-2 Numerical Example 


т а-ы ысы. 


Treat 1 Treat 2 Treat 3 
Poma aoc re LS ME. 
3 6 4 8 3 6 
1 4 5 9 p 7 
3 5 5 7 2 7 
1 3 4 9 3 7 
бі) 2 4 3 8 4 8 
1 3 1 5 1 5 
4 6 2 3j 4 7 Totals 
B) M ТОЗ EAT рут 58 131 
XC y 41 147 96 43 59 3a» 196 881 
ZxyY 75 191 132 398 
= Se с 2 
Total 
IF pra 8.86 13.71 7.43 30.00 = Е,, 
(ii) ЖК Ен 8.57 9.29 4.43 22.29 = E,, 
Tut oe 97 | іл | 543 26.86 = Ep 
(1x) = (58)°/21 = 160.19 (1y) = (131)2/21 = 817.19 
(2x) = 196 (2у) = 881 
(3х) = (15° + 242 + 193/7 (Зу) = (31° + 53° + 472)/7 
(iii) = 166.00 = 854.14 
(1ху) = (58)(131)/21 = 361.81 
(2xy) = 398 
Gxy) = [15)(31) + (24)(53) + (19)(47)]/7 = 375.71 
S, = (2x) — (1х) = 35.81 Sry = Qxy) - (Ixy) = 36.19 
Еш = (2x) — (3х) = 30.00 Ezy = (2xy) - (xy) = 22.29 
Тш = (3x) — (1х) = 5.81 Tey = (3ху) — (Ixy) = 13.90 
(iv) Sy, = (2y) — (1y) = 63.81 


En = Qy) — (Зу) = 26.86 
Туу = (Зу) — (1y) = 36.95 


These entries are the sums of the corresponding entries under the treatment 
headings. 


The entries in part ii are within-class error terms for each of the classes. 
For example, 


Еш, = 41 — (15/7) = 8.86, 
Ex, = 15 — [(15)(31)/7] = 8.57, 
Ey, = 147 — (318/7) = 9.72. 
At the right of partiiare the totals for the entries in the corresponding rows. 


Computational symbols used in the over-all analysis are given in part iii. 
These symbols are defined in part iii of Table 11.3-1. 
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To test the hypothesis of homogeneity of within-class regression, from 
part ii of Table 11.3-2 one computes 


ps 8.57? 9.292 44» 
Dy П 


Жу; 


Е 886 1371 ^ 143 


quj 


= 8.29 + 6.29 + 2.64 = 17.22. 


The variation of the individual observations about the unpooled within- 
class regression lines is 


2 


S, — E, у Ёш 26.86 — 17.22 = 9.64. 


уу 


22; 


The variation of the individual within-class regression coefficients about the 
pooled within-class regression coefficient is 


2 2 
s, =» Ёзи 24 17.22 — 16.56 = .66. 


Es, En 
The statistic used in the test for homogeneity of within-class regression is 
5/06 = 1) _ (66/2 
S,/k(n —2) (9.64)/15 


The critical value for a .10-level test is Ғ..(2,15) = 2.70. The experimental 
data do not contradict the hypothesis of homogeneity of within-class 


Table 11.3-3 Analysis of Variance 


Source SS | ағ MS | F 
Ton] СЕ ИШ ЛЫ 20 
Еггог | Ey =2 I8 | 149 
"x2 18.48 | 12.40 


Treatments ЕРЕ 6.95 | 


regression. The outcome of this test, іп part, does not rule against pool- 
ing the within-class regressions. The appropriateness of the covariance 
model for the data justifies, in the long run, such pooling procedures. 4 
An analysis of variance оп the criterion variable alone is summarized in 
Table 11.3-3. Although the F test here indicates statistically significant 
differences in the treatment means, (2,18) = 6.01, direct interpretation 
of this test is difficult since the treatment classes appear to have different 
Covariate means. A question might be raised about differences NN 
treatments after adjustment is made for the linear trend in the relations ip 
between the criterion and the covariate. Phrased in other words, do dif- 
ferences between the criterion means remain after à statistical adjustment 
has been made for the effects of the covariate? In a sense, the analysis of 
Covariance attempts to approximate the situation in which each of the 
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treatment groups is equated on the covariate; this approximation pro- 
cedure assumes that a linear model is appropriate. 

The analysis of covariance is summarized in Table 11.3-4. The F ratio 
in this table provides a test of the hypothesis that o? = 0, after the criterion 
data have been adjusted for the linear trend on the covariate. The critical 


Table 11.3-4 Analysis of Covariance 


Source 


SS 


Total 5%, = 27.24 19 
Error Fy = 1030 | 17. 61 
Treatments | 7,,4 = 16.94 2 8.47 13.89 


value for a .01-level test in this case is Fo(2,17) = 6.11. Thus the experi- 
mental data indicate statistically significant differences between the criterion 


Scores for the groups even after adjustment is made for the linear effect of 
the covariate, 


The adjusted means are given in Table 11.3-5, In making multiple tests 
between these means, the average effective error per unit is that given in this 


Table 11.3-5 Adjusted Treatment Means 


b = ЕЕ. = 743 


Xj... | 244 | 343 | 271 X = 2.76 
ЖЖ, —62| 67 | —0s 
Р...) 443 1757 | 671 Р-624 
|- 


l | 
48 | 707 | 675 Y'-62 


Й Т,.(К —1 
MSérrortettective) = MS + ы | 


Ӯ = Y, WF, Ж)... | 


m 
‹ 5.812] _ 
- s +5000 = .67 


Ны а poer 04 n] 


table. In making tests which fall in the a priori category, a somewhat 
different error term is used. For example, to test the hypothesis that 
TERT, 
7.07 — 6.75)? 
р 001-635) Сы a ари 
в1[2 i ee sen] 1848 
61] = + 
7 30.00 
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The critical value for a .01-level test in this case is Fy(1,17) = 8.40. Had 
the average effective error been used, this ratio would be 


_ (7.07 — 6.75)? _ 1024 _ 
2(.67)/7 4914 ` 


The critical value used in the previous test would also be appropriate in this 
latter test. 

It is of interest to examine the magnitude of the individual within-class 
correlations. The squares of these correlations are as follows: 


2 
ТЕСІЛЕ Aa ЕЕ, _ 54 = Eel Ern, ag, 
E, Em, 


w ууз 


The square of (ће pooled within-class correlation is 


2 
"UT 


In terms of this latter correlation, the error term in the analysis of covari- 
ance is given by 


1 
MSérror  MSerror(! = ap t k(n = 1) — ; 


= 1.49(.383)(1 + te) = .61. 


This is the value for the error mean square in Table 11.3-4. Thus the reduc- 
tion in the error term (increase in precision) is primarily a function of the 
magnitudeof;?. Thelarger the latter correlation, the greater the reduction 
in the error term. > 

The reduced sum of squares for treatments, Tyrs depends in part upon 
the magnitude of r% and іп part upon the magnitude of the squared 
between-class correlation. The latter is 
n= ТыТы = .90, 
Т, 
When ғ, is large relative to r,, the reduction in the treatment variation can 
be relatively larger than the reduction in the error variation. When this 
occurs, the F ratio in the analysis of covariance will actually be smaller than 

nalysis of variance. The latter finding 


the corresponding F ratio in the ai c j 
would ae kdai that bias due to the covariate had inflated the F 


ratio in the original analysis of variance. А 
When ғ, is negative апа r, is positive, 7, will always be Mim than 
T,. The reduced sum of squares, Туу, may in this case also be larger 
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than T,. When this happens, the F ratio in the original analysis of 
variance is deflated because of the bias due to the covariate. 

If the treatment classes are disregarded, the square of the over-all correla- 
tion between the covariate and the criterion is 


Soul Sus 


Rather than using a covariance analysis for the data in Table 11.3-2, the 
experimenter might have attempted to use the covariate as a classification or 
stratification factor. If this were done, the experiment would be analyzed 
as a two-factor experiment. The data would have the form given in Table 
11.3-6. In this particular case the cell frequencies would be quite small; 


= .57. 


2 
Ttotal = 


Table 11.3-6 Use of Covariate as a Classification Factor 


Covariate юх CHR н | 
classification 1 5 | т 
>) | 97 | 
2 6 Eie О" ТЕТ 
3 es T a | 67 
2 277 
| 43,3 | 5 5 


further, there would be no entries in some of the cells. If each of the 
resulting cell frequencies is relatively large, say 5 or more, this type of 
stratification on the covariate is generally to be preferred to the analysis of 
covariance. Often neighboring ordered categories having small frequen- 
cies may be grouped in order to build up the cell frequencies. 

Unequal Cell Frequencies. If the number of subjects in each treatment 
class is not constant, the only changes in the computational formulas are as 
follows: 

2 2 
02-144. ну Bh gy Th 
n; n; n; 


(3x) = Х(3х) (xy) = Х(3ху)) (Зу) = Х(3у;) 
The degrees of freedom for this case are as follows: 


Source df 
Sy, (En) — 2 
Evy D -11-1 
Туу k-1 


With these changes, the computational procedures outlined in Table 11.3-1 
may be used. 
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11.4 Factorial Experiment 


The analysis of covariance for a factorial experimentis a direct generaliza- 
tion of the corresponding analysis for a single-factor experiment. Assum- 
ing a p x q factorial experiment having л observations in each cell, the 
model is as follows: 


Yj = Y, — ҚХу- X) =p + & + B, + «бу + 6. 


That is, an observation, adjusted for the effect of the covariate, estimates the 
parameters іп the usual analysis of variance. Ifthe covariate is ignored, its 
effect augments variation due to experimental error; the latter includes all 
uncontrolled sources of variation. It will be assumed throughout this 
section that A and B represent fixed factors. 

The observations on the covariate and the criterion within cell ар, of the 
experiment are represented as follows: 


x Y 
Хук Ук 

эк Үз» 
X, njk Уһ, 


In each of the pq cells in the experiment there are n pairs of observations. 
The following notation denotes the various sums needed in the analysis: 


АВ, = sum of covariate measures in cell ab;,. 

AB, = sum of criterion measures іп cell абу. 
A,, = sum of all covariate measures at level a;. 
А, = sum of all criterion measures at level а). 

В, = sum of all covariate measures at level by. 

sum of all criterion measures at level 5,. 

С, = sum of all covariate measures. 

G, — sum of all criterion measures. 


The means corresponding to the sets of sums defined above are obtained 
by dividing the respective sums by the number of experimental units 
over which the sum is taken. For example, 


> 
ll 


dz AB, A, _ б, 
AB, = FA Ne he б npq 


The variation of the covariate within cell ab;, is 
E У(Х — АВ, Y. 
i 


The corresponding variation of the criterion is 
Ең = ХҰ;» т АВ, )?. 
i 
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The covariation within this cell is 
Ex, = ХХ. — AB, XY, — AB,,,). 


From these data, the within-cell regression coefficient is 
Е, 


by, = 2, 
жеу 
There will be pg such regression coefficients. 
If the within-cell variations of both the covariate and criterion are homo- 
geneous, and if the within-cell covariations are homogeneous, then cor- 


responding sources of variation may be pooled. These pooled estimates of 
the within-cell sources of variation are 


En = EX(,, Ey = EX(E,,,), E, = EX(E,, ). 


The estimate of the within-cell regression coefficient based upon the pooled 
data is 
b= Ве, 
Е,, 
The pooled within-class regression line has the following form: 
Yin = (Xs. жү АВ.) us AB, 


Wik" 
The variation of the observations about this set of regression lines is 
E 
Ej, = XXX( Yi — Yin) с ісек, 
тт 
This source of variation provides an estimate of the variation due to experi- 
mental error in the analysis of covariance; its form is identical to the corre- 
sponding term in a single-factor experiment. И 
The variation due to the main effect of treatment A for the covariate is 
Az, = ng&X(A,, — бр. 
The corresponding variation for the criterion is 
Ay, = п4Х(А,, = G): 
The covariation associated with the main effect of factor A is 
А = nqX(A,, — G,)(4,, — G,). 


Just as in the case of a single-factor experiment, the adjusted variation 
due to main effects of factor А is obtained in a rather roundabout manner. 
One first combines the following sources of variation: 
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The residual variation corresponding to this last line is 


‚ Бү. An En) 
(A+ Bi, = Op gh EE. 
The adjusted variation for the main effect of factor A is 
; Кул А +Е)? Ез 
Ay = (AFE w Е,, Ay sr ER T т E 3 


The variation due to the main effect of factor В is given by a procedure 
analogous to that used in obtaining the main effect of factor A. Thus, 
By, = npX(B,, — Gs)» 
By = npX(B,, Mr GY; 
B,, = npX(B,, — б„)(В„ — 6.) 


| 


Similarly, 
ES E,, Ey 
Bs Buy By 


By + Еш В,, + Ezy By, T Ey 
The adjusted variation due to the main effect of factor B is 
(B, + En) | Еш 
BL = (BATE) ee EE IDEE SEMILLA 
vy ( 7 )w 172 уу Вы = ESS Е, 


The sources of variation associated with the AB interaction are defined as 


follows: 

AB yy = nZX(AB,, =x 4, 457 В,, rt Gy. 

AB,, = nEX(AB,, — Ay, — By + 6^ 

AB,, = nEX(AB,, — А — Ba, + САВ, — А, — B, + бу). 
To obtain the variation due to the interaction adjusted for the covariate, one 
first combines the following sources of variation: 


Exe Еу Evy 
АВ,» АВ, AB. _ 


АВ + Ж. АНЫ р Е, AB,, + Ey 


The adjusted variation for this last line is 
А (AByy + Е)? 
(AB ae Е), = (AB,, a Ey) Ж АВ; И ү 


The variation due to AB, adjusted for the covariate, is 
AB}, = (AB + Ej — Ew 
(АВ„ + En) | Em. 


SU (pases 
AB UT SET NEC 
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The analysis of variance is contrasted with the analysis of covariance in 
Table 11.4-1. Assuming all factors fixed, М5 то is the proper denominator 
for F tests on main effects and the interaction. An approximation to the 


Table 11.4-1 Summary of Analysis of Variance and Analysis of Covariance 


Analysis of variance Analysis of covariance 
Source - = 
SS df MS SS df | MS 
A Ay p-il MS, 4% p-1 | MS, 
B By, nc MS, Bi, 4-1 |MS 
AB AB, | (р-14-1) | MS, АВ), |(p — 1g — 1) MS, 
Error | E, pan — 1) MSeror | Ej | pq(n — 1) —1 MS¢rror 


adjusted sums of squares for the main effects and interactions uses an 
expression analogous to (1) in Sec. 11.2. Thus, 


Ay, = Ay, — 2bA,, + БА, 
By, = By, — 2bB,, si DB. 
AB,, = AB,, — 2bAB,, + AB, 


In cases where the treatments do not affect the covariate or in cases where 
bias due to the covariate is not large, the-approximations given above will 
be quite close to the estimates obtained by the somewhat more laborious 
procedures described earlier. 

The adjusted cell means have the following form: 


АВ ABR = b(AB,,, — АВ). 
The average effective error for the difference between two adjusted cell 
means is 
мша + 4В/(р = Dq — о]. 


n тт 


In making multiple comparisons between the cell means, the effective error 
per experimental unit is 


М4, [ + Sate te), 


тт 


The adjusted mean for level a; is 
А», = 4,, + b(À, — б). 
The variance of a difference between two adjusted means is 
| 1. a 
ng E. 
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In making multiple tests among (һе 4/6, the average effective error per 


experimental unit may be taken as 
MSbrror h 4 ce 


E, 


An adjusted mean for level b is 
В, = В, + v(B,, — G,). 


r 
The corresponding average experimental error per unit is 


В.(4- М); 
Еш 
In making а test for homogeneity of within-cell regression, the error 


term may be partitioned as follows: 
a N 


MSerror lı 213 


Source df 
Eyy = Ey — (ЕМЕ) pan — 1) 1 
5, = ES — DEER Еш) pain = 2) 
Sp = ХЕ, Еа) -(ЕМЕм) рд =! 


S, represents the variation of the observations about the individual within- 
cell regression lines. 5; represents the variation of the individual cell 
regression coefficients about the pooled within-class regression coefficient. 
The test for homogeneity of within-class regression uses the statistic 


_ Sypa = 1) 
S,/pq(n — 2) 

It should be noted that the pooled within-class regression coefficient is 
used in obtaining 4,, Bj, and AB, Implicitin this adjustment process is the 
assumption that b is the appropriate regression coefficient for all these 
adjustments. In most experimental situations in the behavioral sciences 
this assumption very probably oversimplifies what is a more complex 
model. Caution in the use of covariates in factorial experiments 1s 
advised, The model for the analysis outlined in this section 1s a highly 
restrictive one. The experimenter should be aware that such designs 
exist, but he also should be aware of both their strengths and their weak- 


nesses. 


11.5 Computational Procedures for Factorial Experiment 
С tational procedures for the analysis of covariance for a p x q 
арат п observations per cell will be considered first. 


factorial experiment having у j II be ‹ ! i 
The case of unequal cell frequencies is considered later in this section, where 


600 


the equivalent of an unweighted-means analysis is outlined. Procedures 
in this latter case require relatively small changes from those to be given for 
the case of equal cell frequencies. 

Assuming п pairs of observations in each of the pq cells, computational 
formulas for the sums of squares needed in the analysis of covariance are 
given in Table 11.5-1. In each case the range of summation is over all 
possible values of the total that is squared and summed. Symbols (1x) 


ANALYSIS OF COVARIANCE 


Table 11.5-1 Computational Formulas for the Analysis of Covariance 
in a Factorial Experiment 


(1х) = Gz/npq 


onde i (Ixy) = G,G,/npq 
yes 


(y) = Gi/npq 
(ху) = ZXY 


оу = XY? 


G (3) -(хА9/м 
(4х) = (ХВ2/пр 
(5х) = (ZAB?)/n 


Аа = (3x) — (1x) 

Ва = (4x) — (1х) 

(ii) ABa = (5x) — (3x) 
— (4x) + (1x) 

Ex, = (2х) — (Sx) 


(3xy) = (2A,A,)/ng 
(ху) = (2B,B,)/np 
(5ху) = (EAB,AB,)In 


An = (xy) — (Ixy) 
В. = (4ху) — (Ixy) 
АВ, = (5ху) — (3xy) 

— (4xy) + (1ху) 
= (ху) — (5ху) 


Е,-Е,-(ЕМЕ,) 


Еш, 


2 
(4 + Ey, = (Ayy + E) — Gov + Е)? 


(iii) 


B. 2 
(8+ Юу, = By + En) — Ga t Бы” 


Аш + ES 


B 


ae + Bu 


A E 2 
(AB ЕУ, = (AByy + Ej) — (Вы + En) 


through (5x) are those used in 


(АВ. + ЕЛ)? 


(3y) = (2AD/ng 
(Ау) = (2B})/np 
(5y) = (SAB) |n 


Ay = (3y) - (1y) 
By, = (4y) — (1у) 
AB,, = (5y) — (Зу) 
— (4y) + (1y) 
E, = (2у) — Gy) 


Ayy =(A+ Буу "d Ey, 


Byy = (В + E, — Ey 


AB), = (AB + E), — Ely 


an analysis of variance on the covariate. 


Symbols (1y) through (5y) are those used in the usual analysis of variance 


on the criterion. 


covariances needed in the adjustment process. 
A 2 x 3 factorial experiment having n = 5 observations per cell will be 
used to illustrate the computational procedures. Suppose that the p = 2 


levels of factor A represent me 
and suppose that the q = 
For purposes of the prese 
fixed. The covariate me 
ment test on map readin 
Score on a comparable form of the achieve 
pleted. Assume that intact groups of n = 
random to the cells of the experiment. 

this experiment are those given in the up 


£ prior to the trainin 


Symbols (1xy) through (5xy) are used to estimate the 


thods of instructing in teaching map reading, 


3 levels of factor B represent three instructors. 


nt analysis, both factors will be considered to be 
asure in this experiment is the score on an achieve- 
g; the criterion measure is the 
ment test after training is com- 
5 subjects each are assigned at 
Suppose that data obtained from 
per part of Table 11.5-2. 
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Table 11.5-2 Numerical Example 


eet a ————— 


by ЖЕНЕ 
Instr. 1 Instr.2 | Instr. 3 

Method | — - 
x,- а Е о АЕС Ар 
ПЕЕ | 50 90 
| 35 80 | 40 100 | 40 85 
a 40 95 | 45 85 | 40 90 
so 105 | 40 90 | 30 80 
45 100 | 40 90 | 40 85 
50 100 | 50 100 | 45 95 
30 95 | 30 90 |30 85 
a 35. 95 | 40 95 | 25 75 
4s 110 | 45 90 50 105 
30 88 | 40 95 | 35 85 


AB summary: 
Nc 


| bs | Total 
х > | Subs Ы Ж 

| 

| 


а, | 210 475 | 195 450 
a, |190. 48814/1205. _ 410 
Гао 963 | 400 920 


200 430 | 605 1355 
185 445 | 580 1403 


—385 875 | 1185 2758 


(1x) = 46,808 (1xy) — 108,941 (ly) = 253,552 
(2x) = 48,325 (2xy) = 110,065 (2y) = 255,444 
(3x) = 46,828 (xy) = 108,901 (Зу) = 253,629 
(4x) = 46,822 (4xy) = 109,008 (4y) = 253,939 
(5x) = 46,895 (Sxy) = 108,979 (Sy) = 254,019 


olas DONI IU 


ect under method а, and instructor 


In this table, for example, the first subj truc 
on the covariate and the criterion. 


b, has scores of 30 and 85, respectively, 
In symbols, 
Ха-3, Yin = 85. 


An AB summary table appears under the observed data. There are two 
entries in each cell of this table—one represents the sum of the observations 
For example, the sum 


on the covariate, the other the sum for the criterion. he st 
of the n = 5 observations on the covariate under treatment combination 


aby, is 
AB,,, = 30 + 40 + 45 + 40 + 40 = 195. 


The corresponding sum for the criterion data is 


АВ, = 85 + 100 + 85 + 90 + 90 = 450. 
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The entries in the total columns at the right of the AB summary table are the 
sums of corresponding entries in the rows. For example, 


As, = ZAB., = 210 + 195 + 200 = 605. 


The corresponding sum for the criterion data is 


А, = АВ, = 475 + 450 + 430 = 1355, 
k 


The total of the first column in the summary table is 
В, = УАВ, = 210 + 190 = 400. 
j 


The corresponding sum for the criterion is 


В, = 248,, = 475 + 488 = 963. 


The grand totals for the covariate and the criterion are 
G, = ZA, = УВ, = 1185, 
G, = XA, + УВ, = 2758. 


The computational symbols in the lower part of Table 11.5-2 are defined 
in part i of Table 11.5-І. The only symbols requiring special comment are 
those in the center column. These entries are obtained as follows: 


(1185)(2758) 
1ху) = 392A 2/29) 
(Ixy) 30 > 


Qxy) = (40)(95) + (35)(80) + -.. +. (50)(105) + (35)(85), 
(xy) (605)(1355) + (580)(1403) 


15 
(4xy) = (400)(963) + (400)(920) + (385)(875) 
10 1 
(5ху) = (210)(475) + (190)(488) + -<< 4 (185)(445) 
= - 


The basic data for all of these symbols except (2xy) are obtained from the 
AB summary table, 

Sums of squares and sums of products are given in Table 11.5-3. Com- 
putational formulas for these terms are given in parts ii and iii of Table 
11.5-1. Note that it is possible for the entries that are used to obtain co- 


negative (—40). Inspection of the total columns at the right of the AB 
Summary in Table 11.5-2 indicates that the higher criterion total is paired 
with the lower covariate total; hence the negative covariation. 
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Table 11.5-3 Summary Data for Numerical Example 


Аы = 2l Аы = —40 Ay = 71 
Вы = 15 Byy = 67 wy = 387 
AB, = 52 AB, = M ABy = 3 
Еш = 1430 Е,, = 1086 Ey, = 1425 
1518 1124 1892 


Ej, = 1425 — (1086/1430) 

= 600 
(А + ЕУ), = 748 Aly, = 748 — 600 = 148 
(B + E), = 892 Bj, = 892 — 600 = 292 
(AB + ЕУ, = 616 ABl, = 616 — 600 = 16 


——— е ———= 


The analysis of variance for the criterion data is summarized in Table 
11.5-4. This analysis disregards the presence of the covariate. Dif- 
ferences between the methods of training are tested by means of the statistic 


Ree кча), 


59.4 


(The instructor factor is considered to be fixed.) This test indicates no 
statistically significant difference between the methods in so far as the mean 


of the groups is concerned. 


Table 11.5-4 Analysis of Variance 


Source 55 df MS F 
A Methods Ay = 71 1 77 1.30 
B Instructors By = 387 2 193.5 3.26 
AB ABy = 3 2 1.5 

Error Ep = 1425 24 59.4 

Total 1892 29 


Mesum [i ee 
F (2,24) = 3.40 


The analysis of covariance is summarized in Table 11.5-5. Мое that the 
error mean square in this case is 26. 1, compared with 59.4 in the case of the 
analysis of variance. Further note that the adjusted method mean square 
is 148, compared with 77 іп the analysis of variance. This increase in the 
adjusted method variance is a function of the negative covariance for the 
between-method totals. A .05-level test on. the methods in the analysis of 
covariance indicates statistically significant differences between the criterion 
means. Thus, when a linear adjustment is made for the effect of variation 
due to differences in prior experience in map reading, as measured by the 
covariate, there are statistically significant differences between the training 


methods. 
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Table 11.5-5 Analysis of Covariance 


Source SS df MS | F 
A Methods Al, = 148 1 148.0 | 5.67 
B Instructors В, = 292 2 146.0 | 5.59 
AB АВ, = 16 2 8.0 
Error Ej, = 600 23 261 | 
28 | 


Е,(1,23) = 4.28; F4,2,23) = 3.44 
An estimate of the square of the within-cell correlation is 
КБ 


уу 


The mean square due to experimental error in the analysis of covariance is 
approximatel 
i D Mire о... 


The adjusted criterion means for factor A are given in Table 11.5-6. 
Note that the difference between the adjusted means is larger than the 


= .58. 


2 
"within = 


Table 11.5-6 Adjusted Means 
b = ЕЕ» = Л6 


| Method 1 | Method 2 Mean 
AE 40.3 38.7 39.5 = б, 
Az, - б, ij -.8 r 
“+ DNE 90.3 93.5 91.9 — G, 
A, = A, — .76(4,, — б.) 89.7 94.1 91.9 


corresponding difference between the unadjusted means. Had the co- 
Variance in this case been positive rather than negative, the difference 
between the adjusted means would have been smaller rather than larger 
than the difference between the unadjusted means. 

It is of interest to compare the adjusted mean squares given in Table 
11.5-5 with those that would be obtained by the approximation method 
described in the last section, The latter mean squares are as follows: 


SS eee 


SS MS 


Ay = Ay — А, + БА, 
= 77 — 2(76(-40) + (76921) 150 150 
By = By, — 25B,, — BB» 309 154 
AB, = AByy — 2bAB,, — BAB, 16 8 
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These mean squares are slightly larger than the corresponding mean 
squares in Table 11.5-5. 

Unequal Cell Frequencies. Under conditions in which an unweighted- 
means analysis is appropriate for an analysis of variance, there is an equiv- 
alent unweighted-means analysis appropriate for the analysis of covariance. 
With the definitions of the computational symbols given in Table 11.5-7, the 
procedures outlined in Table 11.5-1 may be used to obtain an unweighted- 
means analysis of covariance. 


Table 11.5-7 Computational Formulas (Unequal Cell Frequencies) 


Eua ooo ee eee 
(1x) = ӨЗІМ (Ixy) = G,G,|N (y) = G/N 
(2x) = ХХ? (xy) = EXY (2у) = XY? 
(3х) = (543) (ху) = (24,41) Gy) = EAR 
(4x) = йр(®В) (4ху) = Яр(®В„В,) (4y) = лр(®В) 
(5х) = AŒAB?) (5xy) = a(2AB,AB,) (5y) = ЖУАВ) 


In this analysis it is assumed that the number of observations in each 
of the pq cells is approximately л, the harmonic mean of the cell fre- 


quencies. The latter is 
Pq 


УУ(1[п)` 


The total number of observations in the experiment is designated N. То 
avoid excessive rounding error, several decimal places should be carried on 
each of the means during the intermediate computational stages. The 
computational symbols defined in Table 1 1.5-7 are algebraically equivalent 
to those in part i of Table 11.5-1 when there are equal cell frequencies. 

Illustrative Applications. Learning and conditioning experiments have 
made relatively extensive use of the analysis of covariance. Prokasy, 
Grant, and Myers (1958) report a typical application of the analysis of 
covariance in this context. The purpose of their experiment was to study 
the effect of stimulus intensity (four levels) and intertrial interval (three 
levels) upon acquisition and extinction of eyelid UR RUE The ori- 
terion was the proportion of conditioned responses in units of the nou 
transformation. In the acquisition phase of the study, the score on the 


first day's trials served as the covariate for analyzing the results of the 


second day's trials. In the extinction phase, the combined scores during 


the first and second days’ trials during acquisition served as the opire 
The analysis of covariance during the acquisition phase ya е 2 
clarify interpretation of the trend of the effects. It was found tha E 
intertrial interval had an effect during day 1, and the effect с 
during day 2; however, the latter effect was almost entirely gine le 
from that observed on day 1. Zimbardo and Miller (1958) EX n 
experiment in which covariance analysis is employed for essentially the 


same purpose. 


i= 
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Payne (1958) reports an experiment that is a special case of a 2 x 2x2 
factorial experiment. The plan of this experiment may be represented as 


fol OW ys gee ea eel i, 


| 
Method of initial | Method of | раш 
learning relearning | F ^ 
1 2 
ТЕ es 
a bı | Gin Gus 
2 bs Cis Сүз 
а, ы Co; Сз» 
2 by Gon Сы 


The covariate in this study was the logarithm of the number of trials re- 
quired for initial learning; the criterion was the logarithm of the number of 
trials required for relearning. For example, the subjects in group Gig 
learned initially under method а). Relearning was done under method 5; 
while the subjects were under the influence of drugc, The use of covari- 
ance analysis in this case is to adjust the relearning data for the linear 
effect of the initial learning. 

Cotton, Lewis, and Metzger (1958) report an experiment which is à 
3 x2 x 2 factorial. The type of apparatus in which a behavior pattern 
was acquired defined one factor. A second factor was defined by the type 
of apparatus in which the behavior pattern was extinguished. The time of 
restriction in a goal box defined the third factor. The covariate for the 
extinction data was the score on the last five trials of the acquisition phase. 


11.6 Factorial Experiment—Repeated Measures 


The design to be considered in this section is the analogue of the split-plot 
design in agricultural research. The design may be represented schemati- 
cally as shown at the left of Table 11.6-1. Assume that there are n sub- 
jects in each of the groups. At the right is the usual analysis of variance 
for this design when there is no covariate. 


Table 11.6-1 Factorial Experiment, Repeated Measures 


Source 55 df 
есім айу b, Between subjects | np —1 
OT ID RS NND mai A A p-1 
&|*@ PG RORIS f w TH 
8 б, б c Subj w. gp | p po — 1) 
: Within subjects npg —1) 
| B Er m 
AB | AB, (p — 1)(@ — 
а 6), 6) ::- С, Residual | Ey pq = Da —1 
| ee sal 
ee Se |! a 
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Adjustment procedures depend upon whether or not the between- and 
within-subject regression coefficients can be considered homogeneous. 
In the first part of the discussion that follows these regression coefficients are 
not assumed homogeneous. 

Aside from the matter of homogeneity of the between- and within-sub- 
ject regressions, two cases of this design need to be distinguished. In case 
(1) there is a single covariate measure associated with all the criterion scores 
for an individual. The data for subject i may be represented as follows: 


| b, DN с Eb; 


Subject i | X; S, Ya "EY, 
Here the covariate score X, is a measure taken before the administration of 
any of the treatments. Hence the same JY, is paired with all criterion scores 
on subject 7. In contrast, for case (2) the covariate measure is taken just 
before, just after, or simultaneously with the criterion measure. The data 
for subject / in this case may be represented as follows: 


| ы by эз b, 


Subject i | Ха Ya Ха Yami the Ха 


Thus for case (2) each criterion measure on subject / is paired with a unique 
covariate measure. Case (1) may be considered as a special case of case (2) 
in which all the X;/'s for subject i are equal. Hence computational pro- 
cedures for case (2) may be used for case (1). | 
Under case (2), both the between-subject (whole-plot) comparisons and 
the within-subject (split-plot) comparisons are adjusted for the effect of the 
covariate. Under case (1), only the between-subject (whole-plot) com- 
parisons are adjusted for the effect of the covariate—the within-subject 
(split-plot) comparisons will all have adjustments which are numerically 
equal to zero. 
he notation that will be usedis essentially that defined in Sec. 11.4. Two 
additional symbols are required. Pu 
P, = sum of the 4 observations on the covariate for subject i in group G, 
P,” = sum of the q observations on the criterion for subject i in group б, 
d covariations аге associated with differences 


The following variations an 
between subjects within the groups: 
Р. = EXP — Аа)" ы 
Poy = EEP rin — АХВ, — А) 
Pp = ФУР, — А): 


The linear regression equation for the subjects within-group data has the 


following form: 2. 
Б -Ы А) + А» 


мо) 


= мм 


= oy т . = “ор sm 
10192 0] 
:3I0N 


Tenba oq prm sazenbs 10 sums Зшмоцоу ou *;j»ofqns UaAIS в uo $әлоэ$ Це Цим Paivloosse St onseour oyprivAO2 ajSurs v J] 


I— (I — u(t — 5d Е Кассе fen “q TET тепріѕәу 
$ 
( — byt — d) "d — cs Y ET — ("7 + "gy) = "gy "gy "gy "gy gr 
[3 

тт, + 627 c 
T= “4- rs x 25 (8) Аб “я "g 9 a 
1= (1 = ud = dig) лас “4 “4 “4 d3 “ж fang 
ip іре то de Cet te Ep) m agn A SY TE [4 

2 
JP uonenea рәзѕпіру | z4 AX iX SHES 


ІСӘТІ QEL ur u3isoq 207 sispeuy SOUBHEAOD Z-9'TE AQEL 
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where b, is given by 
2а, 
Pus 
The variation of the residuals about this regression line is 


Ру = Ру С". 


The mean square corresponding to this latter variation is the adjusted error 
for between-subject effects. 

The variations and covariations associated with the treatment effects are 
estimated by the same procedures as those used in Sec. 11.4. The error 
terms for within-subject effects in this design have the following form: 

Residual (x) = within cell (x) — P; 


Thus Exe = У(Х — ABay)* — Paw 

Egy = UX in Yin т” (AB,,,(AB,,,)] = Pay 

Ey = ЖҰ» — АВ, )* — Py 
The regression coefficient for the within-subject effects is given by 

b=. 
E, 

The residuals about the regression line for the within-subject effects have 
variation equal to в 
Ev = Ey — r» ‹ 


An outline of the analysis of covariance is given in Table 11.6-2. 
Computational procedures for this design differ only slightly from those 
given in Table 11.5-1. One additional set of computational symbols is 


required for the case of repeated measures. 
(EP) 
y=, (ы) 00, =" 
4 


ME. ; T A 
In each case the summation is over all possible values of the totals. 
more complete notation for these totals is Pan» each total being garg 
observations. In terms of the symbols defined in Table 11.5-1 and the 


set of symbols defined here, 


Р = (6x) — (3x), 

Egg = (2x) — (5x) — (6x) + (1x), 
Р, = (бу) — (ЗУ), 
E, = (2y) — Gy) — (бу) + (15). 


Р„, = (6xy) — (3ху), 
E,, = (2ху) — (xy) - (6xy) + (Ixy), 
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The variation associated with the treatment effects is defined in Table 1 1.5-1. 
The adjusted mean for level а; has the form 


Ay, = Ay, — b,( Az, — Ga): 
The effective error variance for a difference between two adjusted means in 
this case is = e 
[s p Gu a 


RT 


ng Р 


p 
where sj is the adjusted between-subject mean square for error. An 
adjusted mean for level b, has the form 

Bj, = В, — b(B,, — G,). 


The effective error variance for the difference between two adjusted means 
in this case is " 21 
2 [2 (Ban = 2| 
S| — + а-ы |, 
np E 


where sj, is the adjusted within-subject mean square for error. 
An adjusted cell mean is given by 


AB, b,(4,, — G,) — b,(AB, 


Vik Tik 


АВ, A,,). 
The effective error for the difference between two adjusted cell means which 
are at the same level of factor A is estimated by 
"E zh (АВ,,, = АВЫ, 
zx 
The difference between two adjusted cell means which are not at the same 
level of factor A has the following form: 


АВ, 4B,,,, eed AB,.. 5,4, Aan) 
b(AB,,, — 4, — АВ, 


NEU 


Since this difference involves both between- and within-subject effects, the 
effective error variance is Somewhat more complex. The latter variance is 


estimated by 
19 +@— Di, (4,- 4.74 (Ав — 4, — AB 
T 
nq Poz ј Eve 
If the regression coefficients for between-subject (8,) and within-subject 
(8) effects аге equal, the within-subject regression may be used throughout 
inmaking the adjustments. When the treatments do not affect the covari- 
ate, it 15 reasonable to expect that the between- and within-subject regres- 
sions will be equal. A test on the hypothesis that 2, = 5, is given by 
ОЛЕШЕ, 


t= 2, 
2 2 
V2 +з 


+ 4,0% 


ms 
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where sj = 52/Р,, and 8-5 Е,. Sj and sj are the respective error 
variances for b, and b,. Since the variances in the denominator of this 
t' statistic will not in general be homogeneous, the sampling distribution of 
1 is not that of the usual / statistic. If the degrees of freedom for s; and sj, 
are both larger than 20, the normal distribution N(0,1) may be used to 
approximate the sampling distribution of f^. In other cases the sampling 
distribution of t’ may be approximated by the usual / distribution with 
degrees of freedom f, "TTL 


HS) + Gf)" 


where f, and f, are the respective degrees of freedom for 52 and s;. 

When it can be assumed that f, = w, the analysis of covariance has the 
form given in Table 11.6-3. All the adjustments for within-subject effects 
are identical to those given in Table 11.6-2. The adjustment procedures for 
the between-subject effects are, however, different. For purposes of 
making over-all tests, there is some indication that the adjustments for 
between-subject effects given at the bottom of this table are to be preferred 
to those indicated at the top. The two adjustment procedures are not 
algebraically equivalent. , 

Adjusted means for within-subject effects are identical to those given 
earlier in this section. The adjusted mean for level a; now has the form 


Aj, = А, — ba, — Ge): 
The error variance for the difference between 4), and А, in this case is 


fe 


approximately S d 4i ssa y 
<< ту зт) | 
a Е, 


An adjusted cell mean in this case is 
AB;, = AB,, — БВ, — G2. 
The error variance for the difference between two adjusted cell means which 
are at the same level of factor A is identical to that given earlier in this 
section—this difference is a within-subject effect. The error variance for 
the difference between two adjusted cell means which are not at the same 
level of A is approximately 
2[5/° + (4 — Ds] E (AB, — AB, Se 
nq Еш 

The data in Table 11.6-4 will be used to illustrate 
for this design. Disregarding the covariate, 
part i represents data that would be obtained in a 2 x 2 factorial experi- 
ment having repeated measures on factor B. There are four subjects under 
each level of factor A. Suppose that the covariate measure on a given sub- 
ject is obtained before the administration of any of the treatments, then the 


Numerical Example. 
the computational procedures 


а 
zs (I— d ded + "д" a = My ү "^ [qng 
Tad “УЕ "үйд Se Мр 
3 
25 І-(1-ш(1- 5d LIO (169) ^ч ttg Tenpisoy 
а— (т-д) сда (“я тау)” OF + ar) = “ay gr 
(onm 
тт, + аг 
D а- ud “4- 7e E = x ay Ey d3 m [ang 
guo epe 
1554 "a-g C + MY) = My 
SW JP uonenea рәјѕпіру amog 


“# = "d UUM 55 бешу әзививло) є-ө-трәдер 
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covariate measure for a given subject will be a constant for both levels of 
factor B. Thus the data in Table 11.6-4 represent case (1) of this design. 


Table 11.6-4 Numerical Example 
—— ununi fec Бае аа 


by by Total 
Person 
x Y x x xX Y 
1 3 10 3 8 6 18 
a 2 5 15 5 12 10 27 
3 8 20 8 14 16 34 
ы 4 2 12 2 6 4 18 
G) 
5 1 15 1 10 2 25 
а 6 8 25 8 20 16 45 
7 10 20 10 15 20 35 
8 Des Отона foa A 
39 7 132 39 95 TS 201 
l + аш eS eS LES 
by ba Total 
X Y x Т. x Е 
(ii) a 18 57 18 40 36 97 
а 21 75 21 55 42 19. 
9 132 | 9 95 в 227 
(1х) = 380.25 (1xy) = 1106.62 (1) = 3220.56 
(2x) = 542 (2xy) = 1282 (2y) = 3609 
(iii) (Зх) = 382.50 (3xy) = 1119.00 (Зу) - 3288.62 
(4x) = 380.25 (4xy) = 1106.62 (4y) = 3306.12 
(5x) — 382.50 (Sxy) = 1119.00 (Sy) = 3374.75 
(бх) = 542.00 (бху) = 1282.00 (бу) = 3516.50 
An AB summary table for both the variate and covariate appears in part 


ii. With the exception of symbols (62), (6xy), and (6y), the computational 
symbols given in part iii are defined in part i of Table 11.5-1. Symbols con- 
taining the number (6) were defined earlier in this section. Not all the 
symbols in part iii are required in the analysis of data for case (1), since 
within-subject (split-plot) adjustments will be zero; to show, however, that 
such adjustments will be zero, the complete analysis will be illustrated. 
The computation of the variation for the between- and within-subject 
effects is illustrated in Table 11.6-5. By inserting x, ху, or y in the symbols 
given at the left of part i, one obtains the variations given under the headings 


X*, ХҮ, and Y?, respectively. Рог example, 
Age = (3x) — (1х) = 2.25, 
An = Qxy) — (Ixy) = 12.38, 
Ay = Gy) — (Ly) = 68.06. 
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The adjusted value for the between-subject error term is given by 
Peer 163.00? 


t 227.88 — 61.30. 
Р 159.50 


The adjusted value for the variation due to the main effect of factor A is 
given by 
(А„ + Pay)” 


Aj, = (Ay +P 
vu (Ay 4 Азр. 


w) Р,, 


5 | 2 

= (68.06 + 227.88) — (1238 + 163.00)" 040, 
2.25 + 159.50 

= 44.48, 


Table 11.6-5 Analysis of Variance and Covariance 


x? XY y? (YF 


A (3) — (1) 2.25 12.38 68.06 44.48 
Subj w. 4 (6) — (3) | 159.50 163.00 227.88 61.30 
@ В (0—-()0| 0 0 85.56 
AB (5) — (3) — (4) + (1) 0 0 .57 
Residual (2) — (5) — (6) + (3) 0 0 6.37 
Source of variation 55 df MS | F 
A 68.06 1 68.06 1.79 
Subj w. А 227.88 6 37.98 
(ii) B 85.56 1 85.56 80.72 
AB eye 1 57 
Residual 6.37 6 1.06 
A (adj) 44.48 1 44.48 3.63 
Subj w. А (adj) 61.30 5] 12.26 


The analysis of Variance as well as the analysis of covariance are sum- 
marized in part ii of Table 11.6.5, Only the between-subject (whole-plot) 
comparisons will have nonzero adjustments. If one were to compute the 


adjusted value for the main effect due to factor B, the adjusted value would 
be 


ДЕ Ba Е)? " 
By, = (By, + E,,) — иа БЕ 


(ус 2 
= (85.56 + 6.37) — 6-9 — 6.37 = 8556 = B,,. 


That these adjustments should be zero follows from the fact that the co- 
variate measure is constant for all criterion measures on the same subject. 
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It should be noted that the error mean square for the between-subject 
(whole-plot) comparisons is 37.98 in the analysis of variance. Intheanaly- 
sis of covariance the corresponding error mean square is 12.26. Hence the 
covariate adjustment reduces the error mean square for between-subject 
comparisons from 37,98 to 12.26. This relatively large reduction in the 
error mean square is a function of the magnitude of the within-class 
correlation. The square of this latter correlation is estimated by 
ЕК5 


w 
vy 


The mean square due to the main effect of factor A is reduced from 68.06 
to 44.48 in the adjustment process. 

Under the analysis of variance, the F ratio for the test on the main effect 
of factor A is 1.79, with degrees of freedom 1 and 6; the corresponding F 
ratio under the analysis of covariance is 3.63, with degrees of freedom 1 
and 5. 

Another Numerical Example. The data in Table 11.6-6 will be used to 
illustrate a design in which there are distinct covariate measures for each 
observation, The design in this table may be considered as a special case of 


Table 11.6-6 Numerical Example 
LLL ac oo Toe Ml a араан аа 


b, bs Total 
Subject 
d x Y 2 Y x D 
1 3 8 4 14 7 22 
a 2 5 11 9 18 14 29 
3 11 16 14 22 25 38 
19 35 27 54 46 89 
2 6 1 8 3 14 
а E 8 12 9 14 17 26 
; 6 10 9 9 10 19 19 
20 27 19 32 39 59 
11 20 
7 7 10 4 10 

а 8 8 14 10 18 18 32 
` 9 9 15 12 22 21 37 
24 39 26 50 50 89 
Total 63 101 72 136 135 237 
(1х) = 1012.50 (1ху) = 1777.50 ay) = ee 
(2x) = 1233 (2xy) = 2004 (2у) = кт 
(3x) = 1022.83 (3xy) = 1807.50 Gy) = ee 
(4x) = 1017.00 (4ху) = 1795.00 (4y) E 
(5x) — 1034.33 (5ху) = 1835.67 о) Е р 

(бх) = 1207.50 (6xy) = 1964.00 (6: 1. 
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a 3 x 2 factorial experiment with repeated measures on factor B. There 
are n = 3 subjects in each group. 

Computational symbols obtained in part ii are defined in part i of Table 
11.5-1. Symbol (6) has the following definition: 


(6x) - ОР) PHI +... p 19? 4 2° 


1207.50, 
q 2 
2 Le ae 
(xy) CEP) TOA ап + + 21(37) 1964.00, 
q 2 

Ур? 2 2 А 2 2 
(6y) (ӘР) 9228.2 20° 45 : + 32? + 37 3397.50. 

q 


The sums of squares and sums of products for the variations and covaria- 
tions are summarized in part i of Table 11.6-7. The analysis of variance 
is summarized in part ii. The analysis of covariance is summarized in 


Table 11.6-7 Analysis of Variance and Covariance 
ee a at ы - 


х? ХҮ у? 


a 25 ША 
A (3) — (1) 10.33 30.00 100.00 
Subj w. А (6) — (3) | 184.67 156.50 177.00 
Q B (4) — (1) 450 17.50 68.06 
АВ (5) — (3) — (4) + (1) 7.00 1067 1644 
Bex (subj w. A) (2) — (5) — (6) + (3) 14.00 11.83 13.00 
Total (2) — (1) | 220.50 226.50 374.50 
Source БӘЙГІ mance MS F 
Between subjects 
Iu аланы! 
A 100.00 2 50.00 1.69 
Subj w. А 177.00 6 29.50 
(ii) Within Subjects 
B 68.06 1 68.06 47.26 
АВ 16.44 2 8.22 5.71 
Bx (subj w. А) 13.00 9 1.44 
Source (adjusted) SS df MS jg 
Between subjects 
pas EUM CNS 
A’ ү 54.26 2 27.13 3.06 
Subj w. А ЄР) 44.37 5 8.87 
(іі) ^ Within Subjects 
pes epi 
B' t 31.56 1 31.56 84.16 
АВ н 2.33 2 1.16 3.09 
В x (subj w. А) (Е;,) 3.00 8 0.375 


Sn 252 ры MC DR 
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partiii. The adjusted variation is defined in Table 11.6-2. For example, 


2 2 
Ej, = E, =e доре 500) 
7 B,, + Ej. д 
By = (By + Ey) CREE, 
2 
= (68.06 - 13:00) — 1501-1483), 09 3156, 
4,50 + 14.00 


The between-subject effects in part iii are also adjusted in accordance 
with the procedures given in Table 11.62. Тһе estimates of b, and b, are 


Ры. 9600-20 


2 Төрім 1846731 
En — 1.83 _ шу 
E,, 1400 


PT 


In this case, inspection indicates that Б, may be used throughout in the 
adjustment process. If the adjustment of the between-subject effects is 
made in accordance with procedures given at the bottom of Table 11.6-3, 
the adjusted sums of squares are as follows: 
Aly = Ау — 22,4 + РА 
= 100.00 — 2(.845)(30.00) -- (.845)?(10.33) 
= 56.68, 
Pj, = Py — 2byPoy + ӘР, 
= 177.00 — 2(.845)(156.50) + (.845)°(184.67) 
= 44,36. 


The over-all F test for the hypothesis o; = 0 has the form 


— Anilp —1) _ 36:68/2 _ 3.93, 
Pulp —1) 4436/6 


It is noted that the error term for the within-subject effects is 1.44 in 
the analysis of variance and 0.375 in the analysis of covariance. d 
the between-subject effects the corresponding error terms are 29.50 an 
Е zm i К H CT. H 

eade о. Suppose that an experimenter 1s che кача 
evaluating the effect of two drugs upon blood pressure. Suppose ies er 
that the experimenter desires to evaluate effect upon blood ен w. ж 
а linear adjustment is made for the effect of the drugs соора е. 
The covariate, pulse rate, is in part affected by the drugs. In the ШЕ, З 
random sample of 2n subjects is divided at random into two groups, each o 
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sizen. Subjects in group 1 receive drug 1 first. After the drug has had a 
chance to act, blood pressure and pulse rate are taken on each subject. 
After the effect of drug 1 has dissipated, subjects in group 1 are given drug 2. 
Blood pressure and pulse rate are then obtained under the latter drug, 
Subjects in group 2 are given the drugs in the reverse order. The design 
may be represented schematically as follows: 


| Drug 


Бы ОЙ 


а 2 б; 


The analysis of covariance for this design will permit the experimenter to 
evaluate the effect of the drugs upon blood pressure after a linear adjustment 
has been made for the effect of the drugs upon pulse rate. In this type of 
experiment, one or more levels of the drug factor may represent control 
conditions, 


11.7 Multiple Covariates 


The principles developed for the case of a single covariate generalize to 
more than one covariate. In this section this generalization will be con- 
sidered in the setting of a single-factor experiment. However, multiple 
Covariates may be used with any of the experimental designs for which there 
I$ a corresponding analysis of variance. 

Suppose that there are two covariates X and Z, and a single criterion Y. 
The observations under treatment J in a single-factor experiment may be 
represented as follows: 


Treatment j 


Xy Zy Y 
Xy Zu Ys 
Xi Zi Y 
Xy Lins Y, 


In the analysis of covariance, the criterion data are adjusted for the linear 
effects of both XY and Z through use of a multiple regression equation. 
AA multiple regression equation obtained from the data in class / has the 
orm 


Y= 66) LES Xj) T bs (Zi Жм 2) BE Y, 
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If the within-class regressions are homogeneous, pooled within-class regres- 
sion coefficients may be computed. The multiple regression equation in 
terms of the pooled within-class regression coefficients has the form 

Yj = b. (X — X) bz — Z) + Y, 


ij 


The pooled within-class variances and covariances are defined as follows: 


E,, = ZXQG; — Хуу, E,, = (2, — 2)», 
E, = EX(X;— XY; га Y, Е. = ZX(Z,— ZY; ve. Y), 
Ey = ZX(Y, — Y, Ey, = У(Х — X3 Z;; - 2). 


In terms of these variances and covariances, the pooled within-class regres- 


sion coefficients are given by 
bee E,,E ay ys EE; ^ 

vr d 
р. Енш + Book sy 

yz d 
where d = E,E, — ЕЗ. The sum of the squares of the residuals about 
this regression line is 

E! = Ey — byzEzy — Фан 


The treatment variations and covariations are defined as follows: 


Tas = nX(T,, — G,)*, Ти = пХ Ж. б.у, 
T,-n2XT,—6GXf,— 6). Ty = (Te, — GXT, — 6). 
Ty = nX(T,, in @,)?, Т, T nTa д бәт, т 6). 


To obtain the adjusted treatment sum of squares, one combines correspond- 
ing treatment and error variations and covariations, computes a ш 
regression equation for the combined data, and computes the variation о 
the residuals about this equation. Тһе adjusted treatment variation Is ob- 
tained by subtracting the adjusted error variation from the variation of the 
residuals in the latter regression equation. The combined variations an 
covariations are as follows: 

Say = Tat Е,» 

5, = T,, + E, 


ombined data has 


б=т Ebo Sac Feo 
буу = Ty + Ep 5. = Ты + Em 


The multiple regression equation associated with these c 
the following form: ne eni 
y; = bu — X) + ya — 2) + У. 

eriment, this regression line is the one that 


For the case of a single-factor exp disregarded and an over-all 


would be obtained if the treatment classes were 
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least-squares fit obtained for the entire set of experimental data. The 
regression coefficients are given by 


where d" = 5..5: — S 
The sum of the squares of the residuals about this regression line is 
Siw = (T+ E, = S,, 5; DTS sy 
The adjusted sum of Squares for treatments is 
dS (ОЕ) Ей 


The analysis of covariance for this type of experiment is summarized in 
Table 11.7-l. Since two regression coefficients are estimated from the 


шу 


Table 11.7-1 Analysis of Covariance, Two Covariates 


Source SS df MS E 
MStrea 
Treatments T k—1 М. | F= стеле 
MSérror 
Error 22009 k(n = 1) —2 | Мб. 
Total Sj, kn —3 


Кызы SEE ee у. ___ 


experimental data to obtain the adjusted error, the degrees of freedom for 
the latter source of variation are k(n — 1) — 2. 4 

To compute the variations and covariations, the procedures in parts i 
through iv of Table 11.3-1 may be extended to cover any number of covari- 


ates. For example, 
E}, = (22) — (32), E,, = (2xz) — (3xz), 
Dy, = (32) = (12), TEES (GET) (1xz). 
In order to obtain the adjusted criterion variations, the pooled within-class 


as well as the over-all regression coefficients are required, Given the varia- 


tions and Covariations, the Tegression coefficients are obtained from the 
relations given in this section. 


The adjusted mean for treatment j is 
Ү = Y; — , (X, X) — b, Z, — 7). 
The average effective error variance per experimental unit in making com- 
parisons among adjusted treatment means is 


yan «ы + ТЕ — 2ТЕ. + mee], 
(k — 1XE,E,, — Ez) 
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where 5. is the mean square for error in the analysis of covariance. The 
average effective error variance for the difference between two adjusted 
treatment means is 


2 
52 —, 28у 
YY n 


The effective error variance for the difference between two adjusted treat- 
ment means for a test in the a priori category is 


sd, [2 (X, — Е ОИЕ ee 2уғ.! 
E. у 


п ЕЕ. — Ej, 
When the treatments affect the covariate, the last expression for the error 
variance is the more appropriate for use in testing differences between 
adjusted treatment means. : 
For the case of p covariates, expressions for the regression coeflicients are 
most easily written in matrix notation. Suppose that the criterion is des- 
ignated X, and the covariates Xy, Х,..., X,. Let 


eo = [Eu Ен t Еһ], 5 = [Son 5ш ``‘ 8], 
E Ey сс Ey Su Sig 7 5] 
Еһ Еһ `` Ер Su Ss cc Sap 
Bee Ë | SS 3 ІІ 
| Ел Еа” Ep | 5м S» '' Soo 
b= [bor Буз c7 bowl, b,—[b Воз c bos] 


In terms of these matrices, the regression coefficients for the within-class 
data are 


b—eE 
The regression coefficients for the over-all data are 
b, = sS t. 


The adjusted error and treatment variations are as follows: 
Ey = E, — һер, Ty = 8,- b,So — Ew: 


APPENDIX A 


Topics Closely Related to 
the Analysis of Variance 


АЛ Kruskal-Wallis H Test 


Analogous to a single classification analysis of variance in which there 
are no repeated measures is the analysis of variance for ranked data. Sup- 
pose that there are k treatment classes having п, observations in each class. 
Suppose further that the observations are in the form of ranks. That is, 
the criterion scores are ranks assigned irrespective of the treatment class 
to which an observation belongs. The data given below illustrate what 
1s meant: 


Treatment 1 Treatment 2 Treatment 3 
1 3 6 
2 5 9 
4 8 12 
1 10 13 


1 14 


To test the hypothesis that the ranks within the treatment classes are a ran- 
dom sample from a common population of ranks, the following statistic 
may be used: 

ған SSireat 


MSiotai ' 


Numerator and denominator of the H statistic have the usual analysis-of- 
variance definitions. 


When the hypothesis being tested is true,and when each n; islarger than 5, 


п; and К, special tables for the H statistic are available. Computational 
procedures for this test duplicate the procedures for a single classification 
analysis of variance. The latter procedures correct for tied ranks, whereas 
the specialized formulas for the H statistic require corrections for tied ranks, 
if these occur, 
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The Mann-Whitney U statistic is closely related to the H statistic when 
к = 2. Extensive tables for the U statistic for small л; are available [see 
Siegel (1956, pp. 271-277)]. Individual comparisons between two treat- 
ments following an over-all H test may be made by means of the U statistic. 
An application of this procedure will be found in Lewis and Cotton (1958). 
If one of the treatments represents a control group, the nonparametric 
analogue of the Dunnett procedure is described in Sec. A.3 of this appendix. 

A different approach for handling data which are in terms of ranks is to 
transform the ranks into normalized scores. Tables for making this 
transformation are given in Walker and Lev (1953, p. 480). In the latter 
form the data may be handled by means of the usual analysis of variance. 
The latter approach may lead to somewhat different conclusions. If the 
population to which inferences are to be made is considered to be one in 
which the criterion scores are normally distributed, then the analysis of vari- 
ance in terms of the transformed scores is the more appropriate. On the 
other hand, if inferences are limited to ordinal measurement on the cri- 
terion scale, then the Kruskal-Wallis H statistic provides the more appro- 


priate type of analysis. 


A.2 Contingency Table with Repeated Measures 


Consider an experiment in which n judges are asked to assign ranks to r 
products, Data obtained from this experiment may be summarized as 


follows: 


cun a se 
Rank 
Product х Total 
1 2 J r 
1 nu тіз пу Mir n 
2 LENI Neg лә Nor ue 
i па Nig т Nir te 
r па п ng hrr x 
nr 
лы [n T IM. 


resents the number of times product i receives а 
rank of j. (The sampling distributions to be discussed in this section are 
obtained by limiting procedures which assume л to be large. The approxi- 
mations have been shown to be reasonably close for n = 30 and larger. 
In similar limiting procedures, л = 5 and larger provide adequate approxi- 


mations.) 


In this summary, 71;; гер! 
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To test the hypothesis of no differences between the products with respect 
to the frequency with which the products receive the rank of J; the statistic 


r> [n — (n/r) (rXnj) — п? 


n n 


9; 


may be used. When there are no differences between the frequencies in 
column j, except those due to sampling error, О, has a sampling distribu- 
tion which is approximated by a chi-square distribution having r — 1 
degrees of freedom. 

To test the over-all hypothesis of no differences between the ranks for 
the products, the statistic 


= EQ, 
may be used. Under the hypothesis of no difference between the ranks 
assigned to the products, Anderson (1959) has shown that the statistic 
(r — )o* 


LH — ** — withdf = (r — 1), 
% 


If the statistic (ғ — 1)Q?/r exceeds the critical value for a test having 
level of significance «, as determined from the appropriate sampling dis- 
tribution, the hypothesis of no difference between the frequencies within 
the columns is rejected. 

Anderson (1959) has shown that the Friedman statistic discussed in 
Sec. 4.7 provides a test on r — 1 components of the over-all chi square. 
The latter may be partitioned into individual comparisons, or contrasts, 
each having a single degree of freedom. In making tests on such compari- 
Sons, one uses the following estimates of the variances and covariances for 
the cell frequencies: | 
n(r — 1) 


var (n,;) = mo 
E 


—n 
Cov (пп) = vx | 
r 


where n,; and л, are two frequencies in the same row, 


—n 
соу (п, п.) = =e 
H 


where л, and ль are two frequencies in the same column, and 


n 


cov (1) = re-i > 


where n, and Mym are two frequencies in different rows and columns. 
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A 3 x 3 contingency table will be used for illustrative purposes. The 
cell frequencies are the column headings. 


Пі Nye "3 Nay na Meg Tay Jig Па 
с, 0 0 0 =] 0 | 0 0 0 
с, 1 0 0 0 0 0 --1 0 0 
Cs 1 0 zu 0 0 0 -i 0 1 


The coefficients in row C, represent a linear comparison among the ranks 
assigned to product 2. The numerical value of the chi-square statistic 
corresponding to this comparison is 


(па — na 


сены 
X0, — 3 
var (na — ny) 


The denominator of this statistic is 


var (ny — ng) = var (ng) + Var (ль) — 2 COV (пуп) 


n(r—1) , тг-1) 2-п) 
т? ss n y» 


The above chi-square statistic has one degree of freedom. Should this 
statistic exceed the critical value for an a-level test, the data would indi- 
cate a statistically significant difference between the rankings assigned to 
product 2. | , 

The coefficients in row С, represent a linear comparison among the 
products for rank 1. (This comparison is not orthogonal to C,.) 
The chi-square statistic corresponding to this comparison is 


2 
2 О = Ma) (6 — т) 
Zos = var (ny — Пу) 2n|r 


2 
_ (пу = na) 
2n 


This chi-square statistic has one degree of freedom. б 
The coefficients in row Сз represent a comparison between the differences 


in linear rankings for products 1 and 3. ~ (Comparison C; is orthogonal to 
comparison С.) The chi-square statistic corresponding to this com- 
parison is і 

(ny — Mg + "а — па) 


yar (fı — Mg + Па — Mai) 


Xos = 
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Table A.2-1 Numerical Example 


Product 
Judge Total 
a b с 
1 2 1 3 6=P, 
2 1 2 3 6 
3 1 3 2 6 
4 1 2 3 6 
5 2 1 3 6 
6 1 2 3 6 
7 1 3 2 6 
8 1 2 3 6 
(i) T, = 10 16 22 48 
G*/nr = (48°)/8(3) = 96.00 (ET5)n = 105.00 
xx? =112 (EP3)r = 96.00 
SSproducts = 105 — 96.00 = 9.00 
SSw, шіге = 112 - 96.00 = 16.00 М5, juages = 16.00/16 = 1.00 
SSproducts 
2 3 produc! = 9,00 
ODE MSIE 
m————— oue TREE 
Rank 
Product Total 
1 2 3 
а 6 2 0 8 
í b 2 4 2 8 
(ii) с 0 э) 6 8 
8 8 8 
20, = 30 — 2)?/2(8) =0 


X =26 —0 +6 — 0)/4(8) = 9 


Xê, +28, =9 
TW. 9000 1 f i a NM MN 
The individual variances and covariances required to obtain the term in the 

denominator are given by 


var (n — ng + N33 — ng) = 4 var (nj) — 4 cov (n, na) 


— 4 соу (ль) + 4 cov (Pijem) 


TENA 
r—1° 
Thus, = (r — Dy — ny + Пзз — пз)? 
3 IDEE p зыны. ДМ: B 
4n 


The chi-square statistic used in the Friedman test is equivalent to the 
sum of r— 1 orthogonal comparisons among the products. For the 
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case of a 3 x 3 contingency table, C, and C, are orthogonal comparisons 
ofthis kind. Hence, 


2 SS, 
Xe, aig Ler = ав = а. > 
where the last term on the right is the statistic used in the Friedman test. 

The numerical example given in Table A.2-1 illustrates this last relation- 
ship. Basic data are given in parti. There are n = 8 judges and r = 3 
products. The rankings assigned by each judge are shown. The varia- 
tions obtained in part i are defined as follows: 
xm oc 

nr 
v p? 
55у. juage = =x? x T . 

Computation of this latter source of variation may be simplified when 
по tied ranks are permitted. The critical value for a .05-level test is 
750) = 6.00. Hence the test in part i indicates that the differences in 
ranks assigned to the products are statistically significant. Р 

Data from part i are rearranged to form a contingency table in part 
ii. The comparison C;, which was defined earlier in this section, indicates 
no difference іп the linear ranking for product b. The critical value associ- 
ated with a .05-level test on C; is у%(1). = 3.8. Hence the data indicate 
that there is а statistically significant difference between the linear rankings 


for products а and с. Note that 
ё, + Xe, = Hanks. 
A.3 Comparing Treatment Effects with a Control 


Procedures for comparing all treatments with a control were discussed 
in Sec. 3.10. A nonparametric analogue of these procedures has been 
developed by Steel (1959). A numerical example will be used to illustrate 
the procedures for comparing all treatments with a control when data are in 
terms of ranks. In a sense, these comparisons are part of the over-all 
hypothesis tested by the Kruskal-Wallis H statistic. | || . 

The basic data for this numerical example are given in part 1 of Table 
A.3-l. Suppose that only the rank order of these measurements is con- 
sidered meaningful. The data in part ii are in terms ofranks. To obtain 
these ranks, the control scores and the treatment a scores are combined; 
then ranks | to 2n are assigned to the combined set of scores. In case of 
ties, the mean of the tied ranks is used. The combined sets of scores for the 


control and treatment a groups are as follows: 
35 40 | 45 45 48 50 


SSproaucts Fý 


Scores... 


Ranks... 1 2 | 35 | 35 5 
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Data from the control condition are underscored. The combined sets of 
scores from the control and treatment c conditions are as follows: 
45 50 60 62 | 75 | 75 | 78 | 80 | 80 


| 53 | 55 | 7 | 8.5 | 8.5 


Scores... 84 


Ranks... 1 


2 


adhag 


10 


The sum of the ranks for the control group and each of the treatment 
groups is then computed. Т) represents the sum of ranks for the control 
condition when the scores are ranked with reference to treatment a. The 


Table A.3-1 Numerical Example 


Control Treatment a Treatment b Treatment с 
45 35 58 75 
50 40 62 78 
() 60 n=5 45 70 80 
62 48 78 80 
75 50 80 84 
IIn eM 
Control Treatmenta Treatment b Treatment с 
a b c 
3.5 1 1 1 3 5.5 
6.5 2 2 2 6 7 
(ii) 8 4 3 3.5 8 8.5 
9 5 4 5 9 8.5 
10 7 5.5 6.5 10 10: 
SOR лоо 155 18.0 36.0 39.5 
Le Т) T d Т, Т» 


= a ————— | 


test statistic used in the decision rule about the difference between treatment 
a and the control condition is min (77,7,), that is, the smaller of 7; and Ty. 
Іп this case, 
min (7,7) = min (37,18) = 18. 
As a partial check on the numerical work, 
Т; + T; = n(2n + 1). 

Steel (1959) has constructed tables of the sampling distribution of the 
Statistic min (7;,Т). Probabilities in these tables are in terms of an experi- 
mentwise errorrate. By definition, the latter is the ratio of the number of 
experiments with one or more false significance statements to the total 
number of experiments. For the case n= 5 and k = 3, where К is 
the number of treatments (excluding the control), the critical value for the 
rank sum statistic for a two-tailed test with error rate .05 is 16. The de- 
cision is made to reject the hypothesis of no difference between treatment / 
and the control if 


min (T/,T;) < 16. 
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For the data in Table A.3-1, treatment с is statistically different from the 
control, but none of the other differences between the treatments and the 
control is statistically significant, with a .05-level experimentwise error rate. 
Had the direction of the differences between the control and the experi- 
mental groups been predicted prior to the experiment, one-tailed rather than 
two-tailed tests would be appropriate. The critical value for a .05-level 
one-tailed test in which n = 5 and К = 3 is 18. 


A.4 General Partition of Degrees of Freedom in a Contingency Table 


To illustrate the procedures to be discussed in this section, consider the 
three-dimensional contingency table having the following form (all observa- 
tions are assumed to be independent): 


e Са 

by ba b, by 
41 тш M21 Miz Пір? 
az п Пәзі Noe Nove 
аз пзп "зэ "зз "322 


In general there will Бер classes for category 4,4 classes for category B, and 
r classes for category C. The frequency in cell abc; will be designated by 


the symbol лу. 
Т ер above contingency table is disregarded, the result- 


If the B category in the у 
ing AC summary table will have the following form. 
e сә Total 
41 та та n. 
а; Па Таз Ng, 
a3 тлі LE пз, 
n.a n.a п... 


In general the following notation will be used: 


Уп = nage Уп = т» Ул = N35 
i 7 
Ута ie Ут = Nis Xx" = n. =т.;.; 
БУР = Уту. = Ут. = п... 
tak 43 “ 


ing i i i — | parameters 
If sampling is random with respect to all categories, par 1 

are ee, to specify the population from which the sample of size n... 
was drawn. These parameters may be specified in terms of the following 


proportions: 


Pik = proportion of population frequency in cell аБс;з- 
ijk 
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The expected frequency in cell ась, which will be desi gnated by the symbol 
Tig, 18 
Nise = Pig. 
The expected frequencies for the marginal totals of category A would be 
п. = Ушу; 
jk 


alternatively, 


The symbol Р, designates the population proportion for the category a, 
The other expected marginal frequencies are 


n'y — Рп, 
nj = Pan. 
The expected frequency for a cell in the AB summary table is given by 
Mis, = Утул 
n 


x XP. = Ppi 
k 


Other expected frequencies for two-way summary tables are 
nix = Pign., 


, 
Da = Р уп. 


If all of the pgr — 1 parameters in the population are specified by ana 
priori model, and if the sampling is random with respect to all categories, 
then the total chi square indicated in Table A.4-1 may be partitioned in the 
manner shown in this table. This Partition bears a marked resemblance to 
an analysis-of-variance table, 

Tests with respect to conformity with the specified model may be made, 
provided that the sampling distributions for the statistics indicated may be 
approximated by chi-square distributions. If each of the expected cell 
frequencies is greater than 5, the chi-square distributions will provide good 
approximations. Ifa relatively small number of expected frequencies are 
less than 5, the chi-square approximations will still be good. 


however, certain of the parameters in the model are often estimated from the 
observed data. For example, the parameters P, , Р;, and P , may be 
estimated from the marginal frequencies of the sample data. Under the 
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hypothesis of no interactions of any order (i.e., no two-category or no three- 
category interactions), the expected proportion for cell абс is 

Pig, = Pi PsP 
and the expected frequency in cell abc,;, is 
тк = Pin... 
Under this method for specifying the model for the population, the total 


Table A.4-1 Partition of Chi Square 


Source Chi square df 

Total hu = Хь — пр) т] par = і 

А xè = Ут. — ni Ini p-1i 

B № = Уп. — n^ Inl 4-1 

с x = Уп. — тты p 

AB xh = EX, - туп] — x — xb (p = D(q = 1) 
AC wey = XXn — nalna] — xa — He (p = Yr = 1) 
BC Kee = TEs — пты — x — 2 (4—1) —1) 
ABC | dh. = 22a = 48 — 2h — 23 о 1| (р-04- I(r = 1) 


chi square may be partitioned as shown in Table A.4-2. In this case, note 
that the degrees of freedom for the total chi square are 


(par — 1) = @—D=—@—-Y—-C-Y)= par Pa +2. 


Should the three-factor interaction Бе statistically significant in this type of 
analysis, the two-way summary tables should be studied separately within 


Table A.4-2 Partition of Chi Square When Probabilities are 
Estimated from Marginal Totals 


Source Chi square df 


Total |2 = Хук — ИТТЕ pgr- D —(p-1)-@-1)-@- 1) 


AB уь = ХУЦ, — nis [nil (p = 09 = 0) 
АС Xe = ЭО — niania] (p-1Xr-1) 
BC xh = Ea — nading (q = Wr —1) 


2, — he — Kare (р = 1004 — Dr = 0) 


АВС | ж = Дош — Ха 


a fixed level of the third category. In these latter tables, the marginal totals 
may be used in some cases to estimate the cell frequencies. For example, if 
the AB data for level c; are being studied, under the hypothesis of no inter- 
action between categories A and В for level сі, 

rasan 


тз = 
ij na 
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This expected value for cell ijl will not in general be the same as that ob- 
tained under the hypothesis of no interactions of any order. 

Another case which arises in practice is one in which the sampling is 
restricted with respect to the number of observations in each of the cells of 
the form ab;; but random with respect to the category C. If the marginal 
totals are used in the estimation of P. ,, then 


Under the hypothesis of no interactions, 
Nin = РР уп. 


In this case, chi square may be partitioned in the following manner: 


re ek 
Source df 
Total (pq — Yr — 1) 
AC (p — 1)(r — 1) 
BC (q — Dr — 1) 
ABC (p — Dq — Dr — 1) 


Should the three-factor interaction prove to be statistically significant in this 
case, it is advisable to study the equivalent of simple effects for category C 
at each of the separate levels of factors А and В. 


А.5 Hotelling's T? Test for the Equality of k Means 


In the chapters dealing with repeated measures, implicit in the final F 
tests on within-subject effects was the assumption that the variance-co- 
variance matrix had symmetry of the following form: 


0% ро... pot] 

po? o? pie po? 
а) Does nie pnus : 

о Mos 


Over-all tests on within-subject effects can be made through use of Hotel- 
ling’s Т? statistic without the assumption of symmetry of the above form. 
The only assumptions underlying the latter tests are that the variables have 
a multivariate normal distribution with variance-covariance matrix X, 
where the general form of X. is arbitrary. 
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For a single-classification repeated-measure experiment in which there 
are k treatments and n subjects, Hotelling's 7* has the form 


T? = nx'S^!x, 


71-6 
T, —G 
where xe T 
T,— б. 


and S7 is the inverse of the variance-covariance matrix. The hypothesis 
that o? = біз rejected if 
ты > OOF p, n 0) 
To illustrate the computation of Hotelling's Т? statistic, suppose that the 
following data have been obtained in an experiment in which n = 15 and 


k =3: 


T=10, Ту-М, T%=18, G=14; 


100,00 40.00 0.00 
S= | 40.00 50.00 40.00}, Var — cov = 40.00. 


0.00 40.00 50.00 


From these data on the treatment means, 
SStreat = nX(T; — Gy. = 480.00, 


MStreat = 4000 = 240.00. 


Hence the analysis of variance for the within-subject effects is as follows: 


Source df MS F 
Within subjects 30 
Treatments 2 240.00 6.00 
Residual 28 40.00 


The critical value for a Ol-level test, under the assumption that X has the 


symmetry indicated by (1), is 
F p[k — 1 (n — Ik — DI = F gg(2,28) = 5.45. 


By using the conservative test suggested by Box, the approximate critical 
try assumption on Z,is 


vàlue for a .01-level test, without the symme 
Fal, n — 1) = Fa(1,14) = 8.86. 
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Hotelling's 7? statistic is given by 


090 —.200  .160][—4.00 
Tors = 15[—4.00 0.00 4.00]| —.200 500 — 400 sm 
160 —.400 .500}| 4.00 

= 15(2.08) = 31.20. 


The matrix 5-1 is obtained from the matrix S by the Dwyer square-root 
method or by the Doolittle method. 
The critical value for a .01-level test using the 7° statistic is 


a4) F (3,12) = 22.31. 
12 
In this case the Т? test leads to the rejection of the hypothesis that o? = 0, 
whereas the approximate test does not lead to rejection of this hypothesis. 


Table A.5-1 Numerical Example (Repeated Measures) 
(k = 4, п = 7) 


Observed means: 7, = 16, 7; = 22, T, = 22, Т, —20, 6 = 20 
Estimate of X (assuming symmetry) obtained from observed data: 


3.00 1.80 1.80 1.80 
ғ-| 180 3.00 1.80 1.80 
г 150 1.80 3.00 1.80 
К 1.80 1.80 1.80 3.00, 
(i) 
(54 — —178 | —178  —.178 
$a. | —.178 54 -Л78 -4178 
178 -178 1654. 22-178 
S78 178. --178 654. 
ХОТ, -С% 7-4} 2 2 41 02 
MS iat = 20 Gy 110—4) + 22 + 22 + 0% 
—1 3 
= 56.00 
Gi) М8, = var — cov = 3.00 — 1.80 = 1.20 
MStreat 56.00 
л оне әз. | 
Ми Боо т 07 
(iii) ЕТ Ge 17572 oy’ = 140.0 


Instead of a single-classification experiment, suppose that one had the 
factorial experiment: 


by b, ауы b, 


ат б, б, с, 
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In using the T? statistic for testing effects associated with factor B, the 
following homogeneity assumptions are required : 


X = Dag = = ag = 


In words, the variance-covariance matrices for each level of factor A must 
be equal to X. 

When the matrix = has the symmetry indicated in equation (1), it is 
relatively easy to show numerically that Hotelling’s T? is proportional to the 
usual F statistic in the analysis of variance. Тһе data in Table A.5-1 will be 
used for this purpose. 

In part i the observed treatment means and an estimate, S, of the matrix 
X, assuming symmetry of the type indicated in equation (1), are given. 
From Sone computes 5-1. The latter has the same kind of symmetry that 
Shas. The F ratio that would be obtained from the usual analysis of 
variance is computed in part ii. The T? statistic is computed in part iii. 
It is noted that 

Тў» = (k = І)Ғов 
= (4 — 146.67) = 140.0. 


A.6 Least-squares Estimators—General Principles 


A simple example will be used to illustrate the least-squares principle in 
estimation. These principles will then be stated in general form. Suppose 
that the observations ху, хь, and Xs have the following structural form: 


ху = fy & 
(1) х= В + 28, + е» 

xs = Bo + Es- 
In (1), Ё, and f, are unknown parameters; however, the coefficients of f, 
and fl, are assumed to be known. The variates e, £2 and еҙ are assumed y 
be independently and normally distributed, with expected value zero a 
constant variance o2. Тһе problem is to find linear estimates of Ву, Bay an 
Вз such that the estimate of о? is minimized. Let bı, ba, and bs be estimates 


of the corresponding fs. lex е 
To obtain these estimates, one minimizes the expression 


(2) [671 E ы» -(ә- by — 2, + 0з — by)”. 


Differentiating (2) with respect to bı and b,, setting the resulting е шы 
equal to zero, and then rearranging the terms yields the following norma 


equations: 


2b, + 2b; = Х1 + Xe, 
e 2b, + 5b = 2X_ + Ха. 
andb, Itis noted that (3) is linear 
If further the set (3) is linearly 
y obtain unique values for the 


The problem now is to solve (3) for by 
in the x’s as well as linear 1n the P's. 
independent (in this case they are), then onema 
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b's as linear functions of the x’s, (A set of equations is linearly independent 
if no equation in the set is a multiple of any other or a weighted sum of 
two or more of the other equations.) 
Solving (3), one obtains 
(4) b, = $(5x, + x, — 2x3), 
b = — $ (x; — xy — ху). 

Thus 5, and b, are linear functions of the x's (that is, a weighted sum of the 
X's) The variance of any linear function of the x's is a linear function of the 
corresponding variances. In this case, 

var (by) = de[(5)* + (1)? + (—2)"]o;, 

var (ba) = [(1)# + (—1)? + (— 1) Jor. 
The coefficient of оў is the sum of the squares of the coefficients of the x’s 
in the Corresponding /78. The variance of each of the x’s is assumed to be 
9;. The smaller the variance of the D's, the "better" the estimate. The 


best linear unbiased estimator of a bis the one having the smallest variance. 
In this case, 


var (bj) = #902 = 502 
var (b) = #02 = ig? 
The least-squares estimators b, and b; can be shown to be the best linear 
unbiased estimators of В апа By. The proof of the general case is givenina 
basic theorem in statistics—the Markoff theorem. By way of contrast, let 
bi = x — 2x. 

From (1), X — 2x = By + 28, + 6, — 28, — 26, 

= By + eg — 25. 
Hence b; is a linear unbiased estimate of. В. The variance of bj is 

var (bj) = [(1* + (—2)"]o? = 502 

The Corresponding least-squares estimate b, has variance $02, 


_ Toillustrate (1) through (4) numerically, suppose that one has the follow- 
Ing experimentally determined values for the x's: 


Xi = 6, Xp = 0, Ха = —6. 

From (4) the least-squares estimators are 

bı = 8[5(6) + (0) — 2(—6)] 7, 

ba = —4[(6) — (0) (—6)] 4. 
Substituting these values for the x's and the b’s in (2), one obtains an 
estimate of the sum of Squares due to the г”. 

SSerror = (6 — 7)? + 0 —7 + 8)? + ( 6+ 4)? 
= 6. 
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An algebraically equivalent method for computing SS,,,,, is given by 
(5) SSerror = Ux? — bf, — bafo 


where f, is the linear function of the x’s on the right-hand side of the first 
equation in (3) and / is the corresponding linear function in the second 
equation in (3), In this case 


SSerror = [(6)? + (0)? + (—6)*] — [(7)(6 + 0)] — [(—4)(2 - 0 — 6)] 
=72—42—24=6. 


For the general case, suppose that the set (1а) consists of equations of the 
following form, 
xy = Ay Py + а +5 + anpe + е, 
(a) о И: А 
Xn = а.в + а + 5 ав + Ens 


where n> k. The {7з are unknown parameters, the a’s are known con- 
stants, and the e’s are independently and normally distributed, with mean 
zero and constant variance o7. 

The normal equations will have the general form 


2102... ..р) = ЛО. Xn) 
Qa) ЕИ 3 
КОЛ 99) = fK 20) 


where the g's and f’s denote linear functions. These equations may or may 
not be linearly independent. If they are not, a set of linear restrictions may 
be placed on the b’s so as to yield a unique set of values. These restrictions 
generally have a form which tends to simplify the solution of the кшш 
equations. Although these side conditions do not influence tests on dif- 
ferences between effects or contrasts in general, they do influence the esti- 
mates of the effects per se. Hence such side conditions should not be 
arbitrary in form; rather, they should reflect the structure of the population 
about which inferences are to be drawn, if the estimates per se are ma 
for descriptive purposes. The side conditions are generally of the form 


шь + шә» + + usb, = 0, 


i i iti i 245; 
SUUS Me uds Bri for the b’s, one obtains 
expressions of the following form: 
b, = uh + Шаш oh Cds 
(4a), а MET E. 
b, = сал + ON ma: Г са ы 
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That is, the b’s are linear functions of the в; the latter in turn are linear 
functions of the x’s. Hence the general form of b; is 


b; = dax баж + d; 


im 


Неге b; is a linear unbiased estimate of B; Further, 
var (bs) = (dj, + d$, +--+ + a2, )o2 
Of all linear unbiased estimators of 8» the least-squares estimator b, can be 
shown to have minimum variance. Hence b; is the best linear unbiased 
estimator which satisfies the set (3a) and the side conditions (if any). 
An estimate of o? is given by 

SSerror _ Ux? — ХЫ 

dferror dferror 
Where dferror = n — (number of linearly independent parameters estimated), 


ae 


, 
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Tables 


B.1 
В.2 
В.3 
В.4 
В.5 
В.6 


В.7 
В.8 
В.9 
В.10 
B.11 
B.12 


Unit normal distribution 

Student's f distribution 

F distribution 

Distribution of the studentized range statistic 

Arcsin transformation 

Distribution of / statistic in comparing treatment means with a 
control 

Distribution of Fmax Statistic 

Critical values for Cochran's test for homogeneity of variance 
Chi-square distribution 

Coefficients of orthogonal polynomials 

Curves of constant power for tests on main effects 

Random permutations of 16 numbers 
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Table В.1 Unit Normal Distribution* 


[P@ € 21_,) =1 —a] 
eee SE EE GNE 
1—& Ap ELT ЖЕ ПЕ 77 Bia 
50 0.00 75 0.67 950 1.645 
51 0.03 76 0.71 955 1.695 
52 0.05 77 0.74 960 1.751 
53 0.08 78 0.77 965 1.812 
54 0.10 79 0.81 970 1.881 
55 0.13 80 0.84 .975 1.960 
.56 0.15 :81 0.88 .980 2.054 
57 0.18 82 0.92 .985 2.170 
:58 0.20 83 0.95 990 2.326 
59 0.23 84 0.99 | .995 2.576 
.60 0.25 .85 1.04 .996 2.652 
:61 0.28 .86 1.08 .997 2.748 
.62 0.30 :87 1.13 .998 2.878 
.63 0.33 .88 1.17 .999 3.090 
64 0.36 89 1:23 
:65 0.39 -90 1.28 9995 3.291 
.66 0.41 91 1.34 99995 3.891 
.67 0.44 :92 1.41 
.68 0.47 593: 1.48 .999995 4.417 
69 0.50 94 1.55 
:9999995 5.327 
70 0.52 
71 0.55 
72 0.58 
273 0.61 
74 0.64 
ee 
N(0,1) 
га 
« 
5 


* This table is abridged from Table 9 in Biometrika Tables for Statisticians, vol. 1. 
(2d ed.) New York: Cambridge, 1958. Edited by E. S. Pearson and H. O. Hartley. 
Reproduced with the kind permission of E. S. Pearson and the trustees of Biometrika. 
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Table В.2 Student's г Distribution 


LLL oh a lS 


Percentile point 

df 
70 80 90 95 97.5 99 99.5 
1 73 1.38 3.08 6.31 12.71 31.82 63.66 
2 62 1.06 1.89 2.92 4.30 6.96 9.92 
3 58 98 1.64 2.35 3.18 4.54 5.84 
4 Sk 94 1.53 2.13 2.78 as 4.60 
5 56 92 1.48 2.01 2.57 3.36 4.03 
6 55 91 1.44 1.94 2.45 3.14 3.71 
7 55 90 1.42 1.90 2.36 3.00 3.50 
8 55 89 1.40 1.86 2.31 2.90 3.36 
9 54 88 1.38 1.83 2.26 2.82 3.25 
10 54 88 1.37 1.81 2.23 276 3.17 
П 54 88 1.36 1.80 2.20 2.72 3.11 
12 54 87 1.36 1.78 2.18 2.68 3.06 
13 54 87 1.35 1.77 2.16 2.65 3.01 
14 54 87 1.34 1.76 2.14 2.62 2.98 
15 54 87 1.34 1.75 2.13 2.60 2.95 
16 54 86 1.34 1.75 2.12 2.58 2.92 
17 53 86 1.33 1.74 2.11 2.57 2.90 
18 53 86 1.33 1.73 2.10 2.55 2.88 
19 53 86 1.33 1.73 2.09 2.54 2.86 
20 53 86 1.32 1.72 2.09 2.53 2.84 
21 53 86 1.32 1,72 2.08 2.52 2.83 
22 53 86 1.32 1.72 2.07 2.51 2.82 
23 53 86 1.32 1,71 2.07 2.50 2.81 
24 53 86 1.32 1.71 2.06 2.49 2.80 
25 53 .86 1.32 1.71 2.06 2.48 2.79 
26 53 .86 1.32 1.71 2.06 2.48 2.78 
27 53 .86 1.31 1.70 2.05 241 2.77 
28 393 86 1.31 1.70 2.05 2.47 2.76 
29 .53 85 1.31 1.70 2.04 2.46 2.76 
30 53 .85 1.31 1.70 2.04 2.46 2.15 
40 53 .85 1.30 1.68 2.02 2.42 2.70 
50 .53 85 1.30 1.67 2.01 2.40 2.68 
60 53 85 1.30 1.67 2.00 2.39 2.66 
80 153 85 1.29 1.66 1.99 2.37 2.64 
100 53 .84 1.29 1.66 1.9 2.36 2.63 
200 52 84 1.29 1,65 1.97 2.34 2.60 
500 52 84 1.28 1.65 1.96 2.33 2.59 
© УА 84 1.28 1.64 1.96 2.33 2.58 
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Table B.3 


df for numerator 


e 
B 
5 
8 
R 


denom. 1 2 3 4 5 6 7 8 9 10 11 12 


‚75 5.83 7.50 820 858 882 898 9.10 9.19 926 9.32 936 9.41 
1 90 | 39.9 49.5 53.6 55.8 572 582 58.9 59.4 59.9 60.2 60.5 60.7 
195 161 200 26 225 230 234 237 239 241 22 243 244 


.75 2.57 3.00 3.15 3.23 3.28 3.31 3.34 3.35 3.37 3.38 3.39 3.39 
2 .90 8.53 9.00 916 9.24 9.29 933 935 9.37 9.38 9,39 9.40 941 
95 [18.5 190 192 192 193 193 n ы 

99 |98.5 990 992 99.2 993 99.3 99.4 99.4 99.4 994 99.4 994 


.75 202 228 2.36 239 241 242 243 244 244 244 245 245 
3 90 5.54 5.46 5.39 534 531 528 527 5.25 524 523 522 522 
95 |101 9.55 928 912 910 8.94 8.89 8.85 881 8.79 8.76 8.74 
9 |34.1 30.8 29.5 287 282 279 Zio 278 2700 271 271 


75 1.81 2.00 2.05 2.06 2.07 2.08 2.08 2.08 2.08 2.08 2.08 2.08 
4 90 454 432 419 411 405 401 3.98 
.95 7.71 694 6.59 639 6.26 616 609 
99 |212 180 167 160 155 15.2 15.0 


Raw 
оо о 
RU 
Faw 
AS 
e 
Еле 
nies 
8 
Ru. 
+ © 
кош 
+ № 
5 


75 169 185 188 189 189 1.89 189 189 1.89 1.89 1.89 1.89 
5 .90 406 3.78 3.62 3.52 3.45 340 3.37 3.34 3.32 3.30 328 327 
95 661 5.79 541 519 5.05 4.95 4.88 4.82 4.77 4.74 4.71 4.68 
39 |163 133 121 114 110 107 10.5 103 102 10.1 9.96 9.89 
‚75 162 1.76 1.78 1.79 1.79 1.78 178. UTASAIT 27 1.77 197 
6 .90 3.78 3.46 329 318 3.11 305 3.0 2.98 2.96 2.94 2.92 2.90 
.95 5.99 5.14 4.76 4.53 439 4.28 421 415 410 4.06 4.03 4.00 
99 |137 109 9.78 915 8.75 8.47 8.26 810 7.98 7.87 7.79 7.72 


‚75 157 1.70 172 172 171 171 1.70 1.70 1.69 1.69 1.69 1.68 


.95 5.59 474 4.35 4.12 3.97 3.87 3.79 3.73 3.68 3.64 3.60 3.57 
99 |122 955 845 7.85 7.46 7.19 6.99 684 6.72 662 6.54 6.47 


‚75 154 1.66 167 166 1.66 1.65 164 164 1.64 1.63 1.63 1.62 


95 4.46 407 3.84 3.69 3.58 3.50 344 3.39 3.35 3.31 3.28 
99 |1,3 865 7.59 701 6.63 6.37 6.18 6.03 591 5.81 5.73 5.67 


‚75 


1.51 162 1.63 163 162 161 1.60 1.60 1.59 1.59 1.58 1.58 

9 -90 3.36 301 2.81 2.69 261 2.55 251 247 244 2.42 2.40 2.38 
95 5.12 426 3.86 3.63 348 3.37 329 323 3.18 3.14 3.10 3.07 

.99 |106 802 699 642 6.06 5.80 5.61 5.47 5.35 5.26 5,18 5.11 

„75 149 1.60 1.60 1.59 1,59 1.58 1.57 1.56 1.56 1.55 1.55 1.54 

10 -90 3.28 292 2.73 2.61 2.52 246 241 238 235 232 230 228 
195 496 410 3.71 3.48 333 322 3.14 307 302 298 294 2.91 

99 |100 7.56 6.55 5.99 5.64 5.39 5.20 5.06 4.94 4.85 4.77 4.71 


:75 1.47 1.58 1.58 1,57 1.56 1.55 1.54 1.53 1.53 1.52 1.52 1.51 


СӘЛ 04:04 25.08 E 8090 306 205 22) 2.85 2.82 2.79 
99 | 965 721 622 567 532 5.07 4.89 4.4 4.63 4.54 4.46 4.40 


515 1.46 1.56 1.56 1.55 1.54 1.53 
12 .90 3.18 2.81 2.61 24% 2.39 233 
.95 4.755 3.8 3.49 326 31 1 3.00 
299 9.33 6.93 5,95 541 5.06 4.82 


TABLES 


F Distribution* 


— нн тте ация тет клан ен 


df for numerator 


1—@| df for 
15 20 24 30 40 50 60 10 120 200 50 œ denom. 
9.49 9.58 963 9.67 9.71 9.74 9.76 9.78 9.80 9.82 9.84 9.85| 75 
612 61.7 62.0 623 62.5 627 628 630 631 632 633 633 | 90 | 1 
246 248 249 250 251 252 252 253 253 254 254 254 | 95 
341 343 343 344 345 345 346 347 347 348 348 348| 75 
942 944 945 946 947 947 947 948 948 949 949 949| 90 | 2 
194 194 19.5 19.5 195 19.5 195 195 195 195 195 19.5 | 95 
99.4 99.4 99.5 99.5 99.5 99.5 995 99,5 995 99,5 995 99,5 | 99 
246 246 246 247 247 247 247 247 241 241 247 241| л5 
520 518 518 517 516 515 515 514 514 514 514 513| 90 | 3 
8.70 8.66 8.64 8.62 859 8,58 857 855 8,55 854 853 853| 95 
269 26.7 266 265 264 264 263 262 262 262 261 261 | .99 
2.08 2.08 2.08 2.08 2.08 2.08 2,08 208 2.08 2.08 2,08 208| .75 
387 3.84 3.83 3.82 3.80 340 379 3.78 378 3.77 3.76 3.76| 90 
5.86 5.80 577 575 5.72 570 5.69 5.66 5.66 5.65 564 563| 95 | 4 
142 140 139 138 13.7 13.7 13.7 136 13.6 135 13.5 13.5 | 99 
1.89 1.88 1.88 1.88 1.88 1.88 1.87 1.87 1.87 187 187 187| .75 
324 321 319 317 316 315 314 343 3.12 3.12 3.11 310| 90 | 5 
| 4.62 4.56 4,53 450 446 444 443 441 440 439 437 436| .95 
972 9.55 947 938 929 924 920 9.13 9.1] 908 9.04 9.02| 99 
176 1.76 175 1.75 1.75 145 1.74 174 174 L74 174 174| Л5 
287 2,84 2.82 280 2.78 2.77 2.6 275 2.74 2.73 2.73 272| 90 | 6 
3.94 3.87 3.4 3.81 3.77 3.75 3.74 3.71 3.70 3.69 3.68 3.67) 95 
756 740 731 723 714 7.09 7.06 699 697 693 690 688| 99 
1.8 1.67 1.67 1.66 1.66 1.66 1.65 1.65 1.65 1.65 1.65 1.65) .75 
2.63 259 258 256 2.54 2.52 251 250 249 248 248 247| 90 | 7 
3.51 344 341 338 334 3.32 330 327 327 325 324 323| .95 
631 616 607 599 591 586 582 575 5.74 570 $67 5.65) 99 
162 1.61 1.60 1.60 1.59 1.59 1.59 1,58 1,58 1.58 1,58 1,58) .75 
246 242 240 238 236 235 234 232 232 231 230 229| 90 | 8 
322 3.15 312 308 3,04 302 301 297 297 295 294 293| .95 
552 536 528 520 512 507 5.03 496 495 491 4.88 486| .99 
55 154 1,4 1,53 1,53 153 153 1.53) .75 
2x due 1 12 221 219 218 2.7 247 216| 90 | 9 
2.34 230 228 225 223 222 2 
301 294 290 286 283 280 279 276 275 273 272 211) 95 
496 481 473 465 457 452 448 442 440 436 433 431) . 
80 150 149 149 149 148 148| 75 
YR Л: Б. 130 Уп 209 208 207 206 206| % | 10 
i ў | 1 66 264 262 2.59 2.58 2.56 2.55 254| .95 
285 277 2.74 2.70 2.66 2.64 2. rue T ollie 
456 441 433 425 417 412 408 401 400 396 393 3.91) . 

47 146 146 146 145 145| 75 
цю тае ie ne ы 18 18 15 18 18 18 12 mon 
272 265 261 257 253 251 249 246 245 243 242 240| 45 
425 410 402 394 3.86 381 3.78 3.71 3.69 3.66 3.62 3.60) . 

44 143 143 143 142 142| .75 
148 147 145 M NIST KH 194 193 1.92 1.91 1.90} .90 12 
210 206 204 201 199 197 196 194 19 132 231 230) 95 
262 254 251 247 243 240 238 235 234 242 Зза 336| 39 


3,57 354 347 
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Table B.3 
df for numerator 
df for a 

denom. 1 2 3 4 5 6 7 8 9 10 11 12 
Лэ 1.45 1.54 1.54 1.53 1.52 1.51 1.50 1.49 1.49 1.48 1.47 1.47 

13 :90 | 3.14 2.76 2.56 243 2.35 2.28 223 220 216 214 212 210 
95 | 4.67 3.81 341 3.18 3.03 292 283 277 271 2.67 2.63 2.60 

199 | 9.07 6.70 5.74 5.21 4.86 4.62 444 430 4.19 4.10 4.02 3.96 

‚75 1.44 1.53 1.53 1.52 1.51 1.50 1.48 1.48 1.47 1.46 1.46 1.45 

14 90 | 3.10 273 252 239 231 224 219 215 2.12 210 2.08 2.05 
95 | 4.60 3.74 334 3.11 2.96 2.85 2.76 2.70 2.65 2.60 2.57 2.53 

.99 | 8.86 6.51 5.56 5.04 4.69 446 428 414 4.03 3.94 3.86 3.80 

75 143 1.52 1.52 1.51 1.49 1.48 1.47 1.46 1.46 1.45 1.44 144 

15 90 | 3.07 270 249 236 227 221 2.16 212 209 206 2.04 2.02 
95 | 4.54 3.68 3.29 3.06 2.90 2.79 2.71 2.64 2.59 254 2.51 2.48 

.99 | 8.68 6.36 5.42 489 4.56 4.32 414 400 3.89 3.80 3.73 3.67 

19. 142 1.51 1.51 1.50 1.48 1.48 147 1.46 1.45 1.45 1.44 1.44 

16 :90 | 3.05 2.67 2.46 2.33 224 2.18 2.13 209 2.06 2.03 2.01 1.99 
95 | 4.49 3.63 3.24 3.01 2.85 2.74 2.66 2.59 254 2.49 246 242 

.99 | 8.53 623 529 4.77 4.44 420 403 3.89 3.8 3.69 3.62 3.55 

15 142 1.51 1.50 1.49 1.47 1.46 1.45 1.44 1.43 143 1.42 141 

17 90 | 3.03 2.64 244 231 2.22 245 210 2.06 2.03 2.00 1.98 1.96 
95 | 445 3.59 320 2.96 281 2.70 2.61 2.55 2.49 245 2.41 2.38 

99 | 840 611 5.18 4.67 434 410 3.93 3.79 3.68 3.59 3.52 3.46 

75 141 1.50 149 148 146 145 144 143 142 142 141 140 

18 90 | 301 2.62 242 229 220 2.13 2.08 2.04 2.00 1.98 1.96: 1.93 
95 | 441 3.55 3.16 293 277 2.66 2.58 2.51 2.46 2,41 52.37 2.34 

99 | 8.29 601 5.09 4,58 425 401 3.84 3.71 3.60 3.51 3.43 3.37 

:75 141 1.49 1.49 1.47 1.46 1.44 1.43 142 1.41 1.41 1.40 1.40 

19 90 | 299 2.61 240 227 218 211 2.06 2.02 1,98 1.96 1.94 1.91 
95 | 4.38 3.52 3.13 2.90 2.74 2.60 2.54 248 2.42 2.38 2.34 2.31 

99 | 818 593 501 450 417 394 3.77 3.63 3.52 343 336° 13/30) 

75 | 1.40 1.49 1.48 146 1.45 1.44 1.42 142 141 1.40 1.39 1.39 

20 :90 | 2.97 2.59 238 225 2.16 2.09 2.04 2.00 1.96 1.94 1.92 1.89 
95 | 4.35 3.49 310 287 2.1 2.60 251 245 2/89: 82:35 2931 2.28 

99 | 810 5.85 494 443 410 387 3.70 3.56 346 3:37. 3.29 3:23 

‚75 1.40 1.48 1.47 145 1.44 142 141 140 1.39 1.39 1.38 1.37 

22 90 | 295 2.56 235 222 243 206 201 197 193 1.90 1.88 1.86 
95 | 4.30 3.44 3.05 2.82 2.66 2.55 246 240 2.34 2.30 226 223 

99 | 7.95 5.72 482 431 3.99 3.76 3.59 345 3.35 326 3.18 3.12 

18 139 147 146 144 143 141 140 1.39 1.8 1.38 137 41:26) 

24 90 | 293 2.54 233 219 210 204 1.98 194 191 1.88 1.85 1.83 
95 | 426 340 3.01 278 2.62 2.51 242 236 2.30 225 221 2.18 

99 | 7.82 5.61 472 422 390 3.67 3.50 3.6 3.26 317 3.09 3.03 

15, 1.38 1.46 1.45 1.44 142 141 1.40 1.39 137 1:337 £1.36. 71:35 

26 0 |291 252 231 217 2.08 2.01 1.96 1.92 1.88 1.86 1.84 1.81 
95 | 423 3.37 2.98 2.74 2.59 247 239 232 227 222 248 215 

ЭЭ CLIA 5:53 ДЫ 414 3.82 3.59 3.42 3.29 3.18 3.09 3.02 2.96 

75 | 1.38 146 145 143 141 140 1.39 1.38 137 1.36 1.35 1.34 

28 .90 | 2.89 2.50 229 216 2.06 2.00 1.94 1.90 1.87 1.84 1.81 1.79 
95 | 420 334 295 271 2.56 245 236 229 224 12319 275 212 


99 | 7.64 545 457 407 3.75 3.53 3.36 3. 3.03 2.96 2.90 


F Distribution (Continued)* 


TABLES 


645 


df for numerator 


1—«| df for 
15 20 24 30 40 50 60 10 120 200 500 о denom. 
146 145 144 143 142 142 142 141 141 140 140 140 | .75 
205 201 198 196 193 192 190 1.88 188 146 1.85 1.85 | .90 | 13 
253 246 242 238 234 231 230 226 225 223 222 221 | .95 
382 3.66 359 351 343 338 334 327 325 322 3.19 317 | .99 
144 143 142 144 141 140 140 1.39 139 139 138 138 | .75 
201 196 194 191 189 187 186 183 1.83 1.82 180 180 | .90 
246 239 235 231 227 224 222 219 218 216 214 213 | 95 | 14 
366 351 343 335 327 322 318 3.1 309 306 303 3.0 | .99 
143 141 141 140 139 139 138 138 137 137 136 136 | .75 
197 192 190 187 185 183 182 1.79 179 177 1.76 176 | 90 | 15 
240 233 229 225 220 218 216 212 211 210 208 207 | 95 
352 337 329 321 313 308 3.05 298 296 292 289 287 | .99 
141 140 139 138 137 137 136 136 135 135 134 134 | 75 
194 189 187 184 181 179 178 176 175 174 173 172 | 90 | 16 
235 228 224 219 215 212 211 207 206 2.04 202 201 | .95 
341 326 318 310 302 297 293 246 284 281 278 275 | .99 
140 139 138 137 136 135 135 134 134 134 133 133 | 75 
191 186 184 151 178) ІЛ6 175) 2:79 172 ШІ ШО 19) 90 17 
231 223 219 215 210 208 206 202 201 199 197 196 | 95 
531 316 308 300 292 287 2.89 2.76 275 271 268 2,65 | .99 
1.39 137 136 135 134 134 133 133 132 132 132 | 75 
19 Lu ІШ 1 1:5 174 10 170 69 168 167 160) 50 18 
12» 239 215 211 020502040202. (198 197 195 01230 1.92409 
121 308 300 292 284 278 275 268 266 262 259 207 | 99 
5 
iss A31 435 135 А аз аз alta? МОУ ed nage? 
1998 4 зеи оз ИЛАШ ali ШШК RA GRO 12 
15 Xie 211207 20502000228 194 19-01 ее 5 
222 210 252 284 276 270 061 060 098) 2957 ио 
33 132 131 131 130 130 129 | 75 
L3 136 135 134 139 139 132 ішін 10 К deil so | ж 
1834 179 1.77 174 121 1.69 168 1l. s es | 
1200 212 208 20 199 197 195 191 190 148 Шоо 
To 94 286 278 200m 264 Lal 28 олары аа |с 
136 124. E 1522 stead TO 1909 15006018004 И 128 and 
E MEC Төс ТЕЕ О УКК merase 
й 107 2) 189 185 184 1.82 180 178 | 95 
215 207 203 198 1.94 191 1.89 1. m me 
235 ix Q5 207925082558 аи шу із со шк. 
128 128 127 127 126 | .75 
135 133 132 131 120 1% n E 19 156 154 153| .90 | 24 
178 173 170 1.67 1.64 1.62 161 1 2 De M In|s 
Iu 203 198 194 189 186 144 180 LD 107 024 zn | s 
T. 27% 206 2158 240 2/4 240-229) 2а ЫШ 
26 126 125 125 | 75 
134 122 131 130 129 12 5 135 138 153 1.51 1:50 | 90 | 26 
176 1л 158 165 161 159 138 1% 125 jm үл 169 | 95 
Zo) 139 195 190 185 182 180 176 195 Ti 216 213 | 99 
281 256 258 2001 2428 2аб) 253-254 
25 125 124 124 | 75 
133 131 130 129 128 127 127 126 18 145 1% 18 % 2 
i74 16 165 ША ПЕ) LS Мыл ae МОР 165 |79 
152: 5179 ІЙ ӨЛІ ЛІ 169 145 | 5 


2.04 


1.96 1.91 1.87 


235 260 252 244 235 219 2. 
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Table B.3 
df for numerator 
df for |1 —« — 
denom. 1 2 3 4 5 6 7 8 9 10 11 12 

15 | 1.38 145 144 142 141 1.39 1.38 1.37 1.36 1.35 1.35 1.34 

30 90 | 2.88 249 228 214 2.05 1.98 1.93 1.88 1.85 1.82 179 1.77 
95 |417 3.32 292 269 2.53 2.42 2.33 227 221 216 2.13 209 

99 | 7.56 5.39 4.51 402 3.70 347 3.30 3.17 3.07 2.98 2.91 2.84 

‚75 136 144 142 140 139 137 1.36 1:35 104 4133 132 1131 

40 90 | 2.84 244 223 209 2.00 1.93 1:87: 1584 179) 1076 1773. 1.71 
95 | 4.08 323 2.84 2.61 245 234 225 218 2.12 2.08 2.04 2.00 

99 | 7.31 5.18 431 3.83 3.51 329 312 299 289 2.80 2.73 2.66 

75 1.35 142 141 138 1.37 135 133 1.32 1.31 1.30 129 1.29 

60 90 | 2.79 2.39 218 2.04 1.95 1.87 1.82 1.77 1.74 1.71 1.68 1.66 
95 | 4.00 3.15 2.76 253 237 225 217 210 2.04 1.99 1.95 1.92 

99 | 7.08 4.98 413 3.65 3.34 312 295 282 2.72 2.63 2.56 2.50 

75 | 134 1.40 1.39 137 1.35 1.33 1.31 130 129 128 1:27 1.26 

120 90 | 2.75 2.35 2.3 1.99 1.90 1.82 177 1.72 1.68 1.65 1.62 1.60 
95 | 3.92 3.07 2.68 245 229 217 2.09 2.02 1.96 1.91 1.87 1.83 

99 | 6.85 4.79 395 348 3.17 2.96 2.79 2.66 2.56 2.47 2.40 2.34 

«15 133 1.39 1.38 1.36 1.34 1.32 131 129 128 127 126 125 

200 90 |273 233 211 1.97 1.88 1.80 1.75 1.70 1.66 1.63 1.60 1.57 
.95 | 3.89 3.04 2.65 2.42 2.26 2.14 2.06 1.98 1.93 1.88 1.84 1.80 

:99 | 6.76 471 3.88 341 311 2.89 2.73 2.60 2.50 241 2.34 227 

‚75 92 71439: 3837 1135 1:33 131 1.29 128 127 125 124 124 

© 90 | 2.71 230 208 1.94 185 1.77 172 1.67 1.63 1.60 1.57 1.55 
95 | 3.84 3.00 2.60 2.37 221 2.10 2.01 1.94 1.88 1.83 1.79 1.75 

49 | 663 4.61 3.78 332 3.02 2.80 2.64 2.51 241 2.32 2.25 2.18 


* This table is abridged from Table 18 in Biometrika Tables for Statisticians, vol. 1. 
Reproduced with the kind permission of E. S. Pearson and the trustees of Biometrika. 


TABLES 647 


F Distribution (Continued)* 


df for numerator 


df for 
15 20 24 30 40 50 60 100 120 200 500 о |1-л 4епот. 
132 130 129 1.28 127 126 126 125 124 124% 129% 1937) 175 
172 1.67 1.64 1.61 1.57 1.55 1.54 1.51 1.50 148 1.47 1.46 | .90 30 
201 193 1.89 1.84 1.79 1.76 1.74 1.70 1.68 1.66 1.64 1.62 | .95 
270 255 247 239 230 225 221 2.13 2.11 207 203 201 | .99 
130 128 126 125 124 123 122 121 121 120 119 119 | .75 
1.66 1.61 1.57 1.54 151 148 147 143 142 141 1.39 1.38 | .90 40 
192 1.84 1.79 1.74 1.69 1.66 1.64 1.59 1.58 1.55 1.53 1.51 | .95 
252 237 229 220 211 2.06 2.02 194 192 1,87 1.83 1.80 | .99 
127 125 124 122 121 120 119 117 117 L16 115 1.15 | .75 
1.60 1.54 1.51 148 1.44 1.41 1.40 136 1.35 133 131 1.29 | .90 60 
184 175 1.70 1.65 1.59 1.56 1.53 148 147 144 14! 1.39 | .95 
235 220 212 2.03 1.94 1,88 184 1.75 1.73 1.68 1.63 1.60 199 


124 122 121 119 118 147 116 114 113 112 111 140 | .75 
1.55 148 145 141 1.37 134 132 127 126 124 121 19 | 90 | 120 
175 1.66 1.61 1.55 1.50 146 143 137 135 132 128 1.25 | .95 
2.49 203 1.95 1.86 1.76 1.70 1.66 1.56 1.53 1.48 142 1.38 | .99 
123 121 120 118 1.16 114 112 111 110 109 108 1,06 | 75 
1.52 рді 142 1.38 134 ІЗІ 128 124 122 120 117 1.14 | 90 | 200 
172 1.62 157 152 146 141 1,9 1.32 129 126 122 1.19 | 95 
213 1.97 1.89 1.79 1.69 1.68 1.58 148 144 139 1.33 1.28 | .99 
1.3 1.12 1.09 1.08 107 104 1.00 | .75 
149 142 oe 126 124 118 117 113 108 1,00 | .90 о 
1.67 157 1.52 135 132 124 122 117 111 1.00 95 
152 147 1.36 132 1.25 1.15 1.00 99 


204 1.88 1.79 


(24 ed.) New York: Cambridge, 1958. Edited by E. S. Pearson and Н. O. Hartley 
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Table B.5 Arcsin Transformation ($= 2 arcsin УХ) 


021.2909 | .16 8230 | .56 1.6911 | .951 2.6952 991 2.9516 


025 .3176 | 20 .9273 | 60 1.7722 | 955 27141 | 995 30001 
26 .3239 | 21 .9521| 61 1.7926 | 956 2.7189 | 996 30150 


029 .3423 | 24 1.0239 m 1.8546 1959 2.7338 999 3.0783 


TABLES 


Table B.6 Distribution of ¢ Statistic in Comparing Treatment 


Means with a Control 


651 


defor We К = number of means (including control) 
MSerror 2 3 4 5 д E Я 5 t 
95 | 202 244 268 285 29 
5 |:975 | 257 303 339 3.66 auci i P H3 
%9 | 337 390 421 443 460 473 485 494 503 
‘995 | 403 46 5.09 544 5.73 5.97 618 636 65 
95 | 194 234 256 271 28% 2 
6 | 97 | 245 286 318 341 3.60 $25 388 im ad 
99 | зла 361 388 407 421 433 443 451 45 
1995 | 371 422 460 488 51 530 547 561 544 
95 | 1.89 ол 248 262 27 28 289 295 3401 
7 | 1975 | 236 2.75 3.04 3.24 341 3.54 3.66 376 386 
09 | 300 342 366 383 396 407 415 423 430 
‘905 | 350 395 428 452 40 487 501 513 524 
95 | 186 22 242 255 26 274 28 287 292 
в | 97 231 267 294 313 328 340 351 360 368 
99 | 290 39 35] 360 379 388 396 403 409 
995 | 336 377 406 427 444 458 470 481 4% 
95 | 183 218 237 250 20 268 275 281 286 
o. 915 | 2251 21619 02358 З 180 249: 3402 238. X 252 
952 | 282. 319. 340 355. 346 3075. 382 389 39 
395 | 325 316 39 49 424 47 448 451 46 
os.| i31. 2138 200 24, 256 эы 2201 1216, 281 
о 155 Ж 29 ШЕ З ӨЛЕ За 1352) 546 
АСАН 4555 57 SM 13218. 1328. 1 947 
eel su ASQ зове 59% ЖШ 401 4515 1480. 487 
ss.| ns» br 2319 0238 (2338 2601: 2414 [2:5 2.17 
и | ‘575 | 220, 2:30 ЛЫ 2900 ЭШ 210 24 331 3 
‘99 | 272 306 325 338. 348 356 3.63 39 зм 
995 | 311, 345. 348. 3850 328. 4 4. : і 
ИТ 2р 28! 26 2.69 2/4 
ь | уш Жи vols uS ет 325 $3 
3$) | Ze soi з 382 заң ID) за 362 3.67 
dos gos; 339] 361) ОЛЫ 349p 399) АМ 4IS 422 
as bam zy eT 28% 2 25 266 211 
87106 216. 240 2000 RO 296 gaa sap. 327 
Heo p y x E: 356 3.61 
395; |. 3015 | 3823 05/549 ық ИЛЕ «гі 399 406 413 
9 
ПЕТ а oe 259 264 2.6 
КЮ КОЕ О cui verde 53 EUR xo 5/7 323 
995 | воо ОИ 240 ПЕ а-ы 346 351 3.56 
ТЫВ ао 399 405 


652 APPENDIX B 


Table B.6 Distribution of ¢ Statistic when Comparing Treatment 
Means With a Control (Continued)* 


К = number of means (including control) 


df for |1 —« 
MSerror 2 3 4 5 6 7 8 9 10 


95 1.75 2.06 2.23 2.34 2.43 2.50 2.56 2.61 2.65 
16 .975 2.12 2.42 2.63 2.77 2.88 2.96 3.04 3.10 3.16 
,99 2.58 2.88 3.05 zB UE 3.26 3.33 3.39 3.44 3.48 
2995 2.92 3.22 3.41 3.55 3.65 3.74 3.82 3.88 3.93 


.95 1.73 2.04 221 2.32 2.41 2.48 2.53 2.58 2.62 
18 5972 2.10 2.40 2.59 2.73 2.84 2.92 2.99 3.05 3.11 
.99 2.55 2.84 3.01 3.12 3.21 3.27 3.33 3.38 3.42 
.995 2.88 3.17 3.35 3.48 3.58 3.67 3.74 3.80 3.85 


195 1.72 2.03 219 2.30 2.39 2.46 2,51 2.56 2.60 
20 .975 2.09 2.38 2.57 2.70 2.81 2.89 2.96 3.02 3.07 
.99 2.53 2.81 2.97 3.08 3.17 3.23 3.29 3.34 3.38 
995 2.85 3.13 3.31 3.43 3.53 3.61 3.67 3.73 3.78 


:95 1.71 2.01 2.17 2.28 2.36 2.43 2.48 2.53 2.57 
24 .975 2.06 2.35 2.53 2.66 2.76 2.84 2.91 2.96 3.01 
.99 2.49 2.77 2.92 3.03 3.11 3.17 3.22 3.27 3.31 
.995 2.80 3.07 3.24 3.36 3.45 352 3.58 3.64 3.69 


95 1.70 1,99 215 2.25 2.33 2.40 2.45 2.50 2.54 
30 ‚975 2.04 2.32 2.50 2.62 2.72 2.79 2.86 2.91 2.96 
.99 2.46 2.72 2.87 2.97 3.05 3.11 3.16 3.21 3.24 
.995 2.75 3.01 3.17 3.28 3.37 3.44 3.50 3.55 3.59 


.95 1.68 1.97 2.13 2.23 2.31 2.37 2.42 2.47 2.51 
40 975 2.02 2.29 247 2.58 2.67 2.75 2.81 2.86 2.90 
99 2.42 2.68 2.82 2.92 2.99 3.05 310 3.14 3.18 
995 2.70 2.95 3.10 3.21 3.29 3.36 3.41 3.46 3.50 


95 1.67 1.95 2.10 2.21 2.28 235 2.39 2.44 2.48 
60 975 2.00 2.27 2.43 2.55 2.63 2.70 2.76 2.81 2.85 
199 2.39 2.64 2.78 2.87 2.94 3.00 3.04 3.08 3.12 
.995 2.66 2.90 3.04 3.14 3.22 3.28 3.33 3.38 342 


95 Hesa 14931. 2084-2382 296. 20 25371 124! 245 
120 |.975 | 198 224 240 251 259 2166 271 276 280 

99 236 260 2.73 2.82 2.89 294 299 3.03 3.06 
1995 262 284 298 308 315 3219 7 325* 330 333 


permission of the author, C. W, Dunnett, and the editor. 
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Table B.7 Distribution of Fmax Statistic* 


af fon |Н Ех К = number of variances 
з 
s 2 3 4 5 6 7 8 9 10 
4 | .95 | 960 155 206 252 295 336 375 414 44 
199 | 232% ат (9149298 (8-59 Бо 8192 we 89.59 от 106. 
s | 95 | 745 51087. 130!) 163 (0087 04, (229) 240) 26% 
39 | 149. 222. 2220 7 ЕМС Ж 
6 |.95| 582 838 104 121 137 150 163 175 186 
ә | ita ass Ол Uno Vy! 20 32 M 
7 | 95 | 499 694 844 970 108 118 127 135 143 
‘99 | 889 121 145 165 184 20, 22 2. 24, 
з | 95 | 443 69 78 812 903 978 105 14 117 
‘99 | 750 99 117 132 145 15.8 169 179 189 
9 | 95 | аз 534 631 711 780 841 895 945 991 
19. 464 КЕЗ TA То “ІЗІ” 159 157.) 153 
10 | 95 | 3.72 485 5.67 634 692 742 747 828 8.66 
99 | 585 74 86 96 104 14 118 124 129 
р | 95 | 328 416 49 530 572 69 642 6.72 700 
ШЕКТЕСТІ: Жучу е Od dea 95 139 
i15 | 95 | 286 354 40 437 468 495 S19 540 59 
92) 07 "MAD БЕЗНЕ GOR TEA ОЛЫП TI "mn. 712 
ж | ss | 246 295 39 354 7916 1994. 410 424 437 
$9 | 3324 28 | 22 RC pectin 534154.) 59 
307 | Lor 5240 1250 9 e ОЗІ 191902177 31272 222-749 
Е 30 00422 И И s 039. 1 40 
во 1.95]. 1615155 ов. АШЫП Ыш |2222) 225: 44540 
Jel tos 025: 250045 2320004 НЕДИ: 26, 725 
© 100 '* 1,00 100 19770 3200 ^ 300 — 100. 100 
9510 (о 19 100 0 — 100 100 100 


* This table is abridged from Table 31 in Biometrika Tables for Statisticians, vol. 1. 


(2d ed.) New York: Cambridge, 1958. Edited by E. S. Pearson and H. O. Hartley. 
Reproduced with the kind permission of E. S. Pearson and the trustees of Biometrika. 
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TABLES 655 
Table B.9 Chi-square Distribution* 


df Percentile point 

50 75 90 95 97.5 99 99,5 

1 46 13 27 3.8 5.0 6.6 
s : 7.9 
2 14 2.8 4.6 6.0 74 92 10.6 
3 24 41 6.3 7.8 94 11.3 12.8 
4 34 54 7.8 9.5 11.1 13.3 14.9 
5 44 6.6 92 11.1 12.8 15.1 16.7 
6 5.4 7.8 10.6 12.6 14.4 16.8 18.5 
7 6.4 9.0 12.0 14.1 16.0 18.5 20.3 
8 13 10.2 13.4 15.5 17.5 20.1 220 
9 8.3 11.4 14.7 16.9 19.0 21.7 23.6 
10 9.3 12.5 16.0 18.3 20.5 23.2 25.2 
11] 103 13.7 17.3 19.7 21.9 24.7 26.8 
12| 11.3 14.8 18.5 21.0 233 26.2 28.3 
13] 123 16.0 19.8 22.4 24.7 277 29.8 
14| 133 174 21.1 237 26.1 29.1 313 
15| 143 18.2 223 25.0 27.5 30.6 32.8 
16| 153 19.4 23.5 26.3 28.8 32.0 34.3 
17| 163 20.5 24.8 27.6 30.2 33.4 35.7 
18] 173 21.6 26.0 28.9 31.5 34.8 37.2 
19| 183 22.7 27.2 30.1 32.9 36.2 38.6 
20| 193 23.8 28.4 31.4 34.2 37.6 40.0 
2110203 24.9 29.6 32.7 35.5 38.9 41.4 
22| 213 26.0 30.8 33.9 36.8 40.3 42.8 
23| 223 274 32.0 35.2 38.1 41.6 44,2 
24] 233 28.2 33.2 36.4 39.4 43.0 45.6 
25| 243 29.3 34.4 37.7 40.6 44.3 46.9 
26| 253 30.4 35.6 38.9 41.9 45.6 48.3 
Эт e 31.5 36.7 40.1 43.2 47.0 49.6 
285273 32.6 379 41.3 44.5 48.3 51.0 
29 | 283 33.7 39.1 42.6 45.7 49.6 52.3 
30| 293 34.8 40.3 43.8 47.0 50.9 53.7 
40| 393 45.6 51.8 55.8 59.3 63.7 66.8 
60| 593 67.0 744 79.1 833 884 92.0 
100| 993 109.1 118.5 124.3 129.6 135.8 140.2 


For df > 30, 72-4 < EV A(df) — 1 + z 42. 
For example, when df — 60, 
ys = [V A60) — 1 + 1.645)/2 = 79. 


Р ; , 4 Ж д rere val. d. 
* This table is abridged from Table 8 in Biometrika Tables for Statisticians, Vol. 
(2d ed.) New York: Cambridge, 1958. Edited by E. S. Pearson and H. O. Hartley. 
Reproduced with the kind permission of E. S. Pearson and the trustees of Biometrika. 
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Table B.10 Coefficients of Orthogonal Polynomials 


k | Polynomial] Х-1 2 3 4 5 6 7 8 9 10 À 
3 | Linear —1 0 1 211 
Quadratic 1 -2 1 613 
Linear 528 (LL 1 3 20 | 2 
4 | Quadratic 1 -1 1 4 |1] 
Cubic _1 з —3 1 20 | 1% 
Linear -2 -1 0 1 2 10] 1 
5 | Quadratic 2 -І -2 — 2 14] 1 
Cubic -1 2 0 —2 1 10% 
Quartic 1 —4 6 —4 1 70 |а 
Linear -5 -3 -і 1 3 5 70| 2 
6 | Quadratic 5 0 Ine -4: =e =} 5 84 | % 
Cubic =$ 7 4 -4 -7 5 180 | 54 
Quartic 1 -3 2 2 —3 1 28 | 8 
Linear —3 =2 —1 0 1 2 3 28| 1 
7 | Quadratic 5 0-3 -4 — 0 5 84| 1 
Cubic —1 1 1 0 -1 ~ 1 6 | % 
Quartic 3 —7 1 6 1 —7 3 154 | 9s 
Linear EIN. 1 3 5 Т, 168 | 2 
Quadratic yl 19 3 SM. VES 1 7 168 | 1 
8 | Cubic —7 5 7 3508.3. 218 4 7 264 | % 
Quartic 7 —13 -3 9 9 —3 —13 7 616 | He 
Quintic 7 23 —17 —15 15 17 —23 7 2184 | Ио 
Linear =4 © a 7 0 1 2 3 4 60] 1 
Quadratic 28 7 —8 —17 —20 —17 —8 7 28 2772 | 3 
9 | Cubic —14 7 13 9 0 59 -13 =7 14 990 | % 
Quartic 14 —21 —11 9 18 9 —1 —21 14 2002 | 7 
Quintic —4 П —4 -9 0 9 4 —11 4 468 | 3% 
Linear 2 РЕСЕ. =j 1 3 5 T 9 330 | 2 
Quadratic 6 2 1 3 4 —4 3 1 2 6 132 | 14 
10 Cubic —42 14 35 31 12 —12 —31 —35 —14 42 | 8580 | 5 
Quartic 18 —22 —17 2 18 18 3 -17 —22 18 | 2860 | 5% 


Quintic —6 14, —1 = 356 6 11 1 —14 6]| 780 |: 
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Table B.11 Curves of Constant Power for Tests on 
Main Effects 


L a. Т ӘЛЕ is De жр, a 12 1 
10 0.20 030 040 050 0.60 0.70 < for a= 0.05 
fora=0.01—> 020 0.30 040 050 0.60 


0.50 0.60 | 070-—for a= 0.05 $ 


0 
0.10 020 030 0.40 
for к= 0.01 — 0.15 0.20 030 040 050 0.60 070 
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Table B.11 Curves of Constant Power for Tests on 
Main Effects (Continued)* 


100 


80 


20 


=l ЕЕ 1 = | zi L E 
%0 0.20 0.30 0.40 0.50 0.60 0.70 «— for «=0.05 
for 2=0.01— 0.15 0.20 0.30 0.40 0.50 0.60 0.70 


k=5 
© 
осе е ме | | ] ae JE 
005101090050 20 9.090 ЧӨП SC 1050777 га 005 2 


Ф 
for a=0.01—> 0.15 0.20 0.30 0.40 0.50 0.60 0.70 


* Reproduced from L. S. Feldt and M. W. Mahmoud, Power function charts for 


Specification of sample size in analysis of variance. Psychometrika, 1958, 23, 201-210, 
with permission of the editor. 


Table B.12 Random Permutations of 16 Numbers 
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Table B.12 Random Permutations of 16 Numbers (Continued) 
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